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(57) Abstract: Disclosed is a family of P450 monooxygenases, each member of which regio selectively oxidizes avermectin to 
4"-keto-avermectin. The P450 monooxgenases find use in methods and formulations for making emamectin from avermectin. Also 
disclosed are methods for purifying the P450 monooxygenases of the invention, binding agents that specifically bind to the P450 
monooxygenases of the invention, and genetically engineered cells that express the P450 monooxygenases of the invention. Also 
disclosed are ferrodoxins and ferredoxin reductases that are active with the P450 monooxygenases of the invention. 
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METHODS AND COMPOSITIONS FOR MAKING EMAMECTIN 

The invention relates to the field of agrochemicals, and in particular, to insecticides. 
More specifically, this invention relates to the derivatization of avermectin, particularly for the 
synthesis of emamectin. 

Emamectin is a potent insecticide and controls many pests such as thrips, leafminers, 
and worm pests including alfalfa caterpillar, beet armyworm, cabbage looper, corn earworm, 
cutworms, diamondback moth, tobacco budworm, tomato fruitworm, and tomato pin worm. 
Emamectin (4"-deoxy-4"-epi-N-methylamino avermectin Bla/Blb) is described in U.S. Patent 
No. 4,874,749 and in Cvetovich, RJ. et al, J. Organic Chem. 59:7704-7708, 1994 (as MK- 
244). 

U.S. Patent No. 5,288,710 describes salts of emamectin that are especially valuable 
agrochemically. These salts of emamectin are valuable pesticides, especially for combating 
insects and representatives of the order Acarina. Some pests for which emamectin is useful in 
combating are listed in European Patent Application EP-A 736,252. 

One drawback to the use of emamectin is the difficulty of its synthesis from avermectin. 
This is due to the first step of the process, which is the most costly and time-consuming step 
of producing emamectin, in which the 4"-carbinol group of avermectin must be oxidized to a 
ketone. The oxidation of the 4"-carbinol group is problematic due to the presence of two other 
hydroxyl groups on the molecule that must be chemically protected before oxidation and 
deprotected after oxidation. Thus, this first step, significantly increases the overall cost and 
time of producing emamectin from avermectin. 

Because of the efficacy and potency of emamectin as an insecticide, there is a need to 
develop a cost and time effective method and/or reagent for regioselectively oxidizing the 4"- 
carbinol group of avermectin to produce 4"-keto-avermectin, which is a necessary 
intermediate for producing emamectin from avermectin. 

The invention provides a novel family of P450 monooxygenases, each member of which 
is able to regioselectively oxidize the 4"-carbinol group of unprotected avermectin, thereby 
resulting in a cheap, effective method to produce 4"-keto-avermectin, a necessary intermediate 
in the production of emamectin. The invention allows elimination of the costly, time- 
consuming steps of (1) chemically protecting the two other hydroxyl groups on the avermectin 
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molecule prior to oxidation of the 4"-carbinoI group that must be chemically protected before 
oxidation; and (2) chemically deprotecting these two other hydroxyl groups after oxidation. 
The invention thus provides reagents and methods for significantly reducing the overall cost 
of producing emamectin from avermectin. 

■ Accordingly, in one aspect, the invention provides a purified nucleic acid molecule 
encoding a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and 
regioselectively oxidizes avermectin to 4"-keto-avermectin. 

■ In a specific embodiment , the invention relates to an purified nucleic acid molecule 
comprising a nucleotide sequence encoding a polypeptide that exhibits an enzymatic 
activity of a P450 monooxygenase and regioselectively oxidizes avermectin to 4"-keto- 
avermectin, which polypeptide is substantially similar, and preferably has between at least 
50%, and 99% amino acid sequence identity to the polypeptide of SEQ ID NO:2, with 
each individual number within this range of between 50% and 99% also being part of the 
invention. 

■ In a further specific embodiment , the invention relates to an purified nucleic acid 
molecule comprising a nucleotide sequence encoding a polypeptide that exhibits an 
enzymatic activity of a P450 monooxygenase and regioselectively oxidizes avermectin to 
4"-keto-avermectin, which polypeptide is immunologically reactive with antibodies raised 
against a polypeptide of SEQ ID NO:2. 

■ The invention further provides a purified nucleic acid molecule comprising a 
nucleotide sequence 

a) as given in SEQ ID NO: 1 ; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, or the 
complement thereof; 

e) complementary to (a), (b) or (c); 

f) which is the reverse complement of (a), (b) or (c), or 



2 



WO 02/092801 



PCT/EP02/05363 



g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"-keto-avermectin. 

In a specific embodiment , the invention relates to a purified nucleic acid molecule 
comprising a nucleotide sequence encoding a polypeptide that exhibits an enzymatic 
activity of a P450 monooxygenase and regioselectively oxidizes avermectin to 4"-keto- 
avermectin, which polypeptide is substantially similar, and preferably has at least between 
60%, and 99% amino acid sequence identity to the polypeptide of SEQ ID NO:2, SEQ ID 
NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, 
SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID 
NO:95, with each individual number within this range of between 60% and 99% also being 
part of the invention. 

In a further specific embodiment , the invention relates to an purified nucleic acid 
molecule comprising a nucleotide sequence encoding a polypeptide that exhibits an 
enzymatic activity of a P450 monooxygenase and regioselectively oxidizes avermectin to 
4"-keto-avermectin, which polypeptide is immunologically reactive with antibodies raised 
against a polypeptide of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ED NO:8, SEQ 
ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID 
NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, 
SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

The invention further provides a purified nucleic acid molecule comprising a 
nucleotide sequence 

a) as given in SEQ ID NO:l, SEQ ED NO:3, SEQ ED NO:5, SEQ ID NO:7, SEQ ED 
NO:9, SEQ ED NO: 11, SEQ ID NO: 13, SEQ ED NO: 15, SEQ ID NO: 17, SEQ ID 
NO:19, SEQ ED NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, SEQ ID 
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NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO:13, 
SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or 
SEQ ID NO:94 or the complement thereof; 

e) complementary to (a), (b) or (c); 

f) which is the reverse complement of (a), (b) or (c); or 

g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"-keto~avermectin. 

In certain embodiments, the nucleic acid molecule comprises or consists essentially of 
a nucleic acid sequence that is at least between 66%, and 99% identical to SEQ ID NO: I, 
with each individual number within this range of between 66%, and 99% also being part of 
the invention.. 

In certain embodiments, the nucleic acid molecule comprises or consists essentially of 
a nucleic acid sequence that is at least between 70%, and 99% identical to SEQ ID NO:l, 
SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO.ll, SEQ ID 
NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ 
ID NO:94, with each individual number within this range of between 70%, and 99% also 
being part of the invention.. 

In some embodiments, the nucleic acid molecule comprises or consists essentially of a 
nucleic acid sequence that is at least 80% identical to SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO: 13, SEQ ID NO: 15, 
SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID 
NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94. 

In certain embodiments, the nucleic acid molecule comprises or consists essentially of 
a nucleic acid sequence that is at least 90% identical to SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, 
SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID 
NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94. 
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In certain embodiments, the nucleic acid molecule comprises or consists essentially of 
a nucleic acid sequence that is at least 95% identical to SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ED NO:9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, 
SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID 
NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94. 

In some embodiments, the nucleic acid molecule comprises or consists essentially of a 
nucleic acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO: 13, SEQ ID 
NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, 
SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, and SEQ ID NO:94. 

In particular embodiments, the nucleic acid molecule is isolated from a Streptomyces 
strain. In certain embodiments, the Streptomyces strain is selected from the group 
consisting of Streptomyces tubercidicus, Streptomyces lydicus, Streptomyces platensis, 
Streptomyces chattanoogensis, Streptomyces kasugaensis, and Streptomyces rimosus and 
Streptomyces albofaciens. . 

In some embodiments of this aspect, the nucleic acid molecule further comprises a 
nucleic acid sequence encoding a tag which is linked to the P450 monooxygenase via a 
covalent bond. In certain embodiments, the tag is selected from the group consisting of a 
His tag, a GST tag, an HA tag, a HSV tag, a Myc-tag, and VSV-G-Tag. 

In another aspect, the invention provides a purified polypeptide that exhibits an 
enzymatic activity of a P450 monooxygenase and regioselectively oxidizes avermectin to 
4"-keto-avermectin . 

In some embodiments, the polypeptide comprises or consists essentially of an amino 
acid sequence that is encoded by a nucleic acid molecule 

a) as given in SEQ ID NO: 1 or the complement thereof; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, or the 
complement thereof ; 

e) complementary to (a), (b) or (c); 
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f) which is the reverse complement of (a), (b) or (c); or. 

g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"-keto 7 avermectin. 

In some embodiments, the polypeptide comprises or consists essentially of an amino 
acid sequence that is between at least 50%, and 99% identical to SEQ ID NO:2, with each 
individual number within this range of between 50% and 99% also being part of the 
invention.. 

In some embodiments, the polypeptide comprises or consists essentially of an amino 
acid sequence that is encoded by a nucleic acid molecule 

a) as given in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID NOrll, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID 
NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94 or the complement thereof; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NOrll, SEQ ID NO:13, 
SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or 
SEQ ID NO: 94 or the complement thereof, or the complement thereof; 

e) complementary to (a), (b) or (c); 

f) which is the reverse complement of (a), (b) or (c); or 

g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"-keto-avermectin. 

In some embodiments, the P450 monooxygenase comprises or consists essentially of 
an amino acid sequence that is between at least 60%, and 99% identical to SEQ ID NO:2, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID 
NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, 
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SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ 
ID NO:95, with each individual number within this range of between 60% and 99% also 
being part of the invention.. 

In certain embodiments, the P450 monooxygenase comprises or consists essentially of 
an amino acid sequence that is at least 70% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In some embodiments, the P450 monooxygenase comprises or consists essentially of 
an amino acid sequence that is at least 80% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ 
3D NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In some embodiments, the P450 monooxygenase comprises or consists essentially of 
an amino acid sequence that is at least 90% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:8, SEQ ID NO.10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In certain embodiments, the P450 monooxygenase comprises or consists essentially of 
an amino acid sequence that is at least 95% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In some embodiments of this aspect of the invention, the P450 monooxygenase 
comprises or consists essentially of an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, 
SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, 
SEQ ID NO:34, and SEQ ID NO:95. 

In certain embodiments, the polypeptide according to the invention exhibiting an 
enzymatic activity of a P450 monooxygenase further comprises a tag. In some 
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embodiments, the tag is selected from the group consisting of a His tag, a GST tag, an HA 
tag, a HSV tag, a Myc-tag, and VS V-G-Tag. 

In another aspect, the invention provides a binding agent that specifically binds to a 
polypeptide according to the invention exhibiting an enzymatic activity of a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin. In some 
embodiments, the binding agent is an antibody. In certain embodiments, the antibody is a 
polyclonal antibody or a monoclonal antibody. 

In yet another aspect, the invention provides a family of P450 monooxygenase 
polypeptides, wherein each member of the family regioselectively oxidizes avermectin to 
4"-keto-avermectin . 

In certain embodiments, each member of the family comprises or consists essentially 
of an amino acid sequence that is between at least 50%, and 99% identical to SEQ ID 
NO:2, with each individual number within this range of between 50% and 99% also being 
part of the invention.. 

In certain embodiments, each member of the family comprises or consists essentially 
of an amino acid sequence that is between at least 60%, and 99% identical to SEQ ID 
NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID 
NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, 
or SEQ ID NO:95, with each individual number within this range of between 60% and 
99% also being part of the invention.. 

In some embodiments, each member of the family comprises or consists essentially of 
an amino acid sequence that is at least 70% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In certain embodiments, each member of the family comprises or consists essentially 
of an amino acid sequence that is at least 80% identical to SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID 
NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, 
SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. In 
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some embodiments, each member of the family comprises or consists essentially of an 
amino acid sequence that is at least 90% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ ID 
NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, 
SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID 
NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In certain embodiments, each member of the family comprises or consists essentially 
of an amino acid sequence that is at least 95% identical to SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID 
NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, 
SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In some embodiments of this aspect of the invention, each member of the family 
comprises or consists essentially of an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, 
SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID 
NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, 
SEQ ID NO:34, and SEQ ID NO:95. 

In still another aspect, the invention provides a cell genetically engineered to comprise 
a nucleic acid molecule encoding a polypeptide which exhibits an enzymatic activity of a 
P450 monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin. 

In some embodiments, the nucleic acid molecule is positioned for expression in the 
cell. In certain embodiments, the cell further comprises a nucleic acid molecule 
comprising a nucleotide sequence encoding a polypeptide according to the invention 
exhibiting an enzymatic activity of a ferredoxin protein. 

In some embodiments, the cell further comprises a nucleic acid molecule comprising a 
nucleotide sequence encoding a polypeptide according to the invention exhibiting an 
enzymatic activity of a ferredoxin reductase protein. 

In certain embodiments, the cell is a genetically engineered Streptomyces strain. In 
certain embodiments, the cell is a genetically engineered Streptomyces lividans strain. In 
particular embodiments, the genetically engineered Streptomyces lividans strain has NRRL 
Designation No. B-30478. In some embodiments, the cell is a genetically engineered 
Pseudomonas strain. In some embodiments, the cell is a genetically engineered 



9 



WO 02/092801 



PCT/EP02/05363 



Pseudomonas putida strain. In certain embodiments, the genetically engineered 
Pseudornonas putida strain has NRRL Designation No. B-30479. In some embodiments, 
the cell is a genetically engineered Escherichia coli strain. 

In another aspect, the invention provides a purified nucleic acid molecule comprising a 
nucleotide sequence encoding a polypeptide according to the invention exhibiting an 
enzymatic activity of a ferredoxin, wherein the nucleic acid molecule is isolated from a 
Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes 
avermectin to 4"-keto-avermectin. 

In a specific embodiment , the invention relates to an purified nucleic acid molecule 
comprising a nucleotide sequence encoding a polypeptide that exhibits the enzymatic 
activity of a ferredoxin, which polypeptide is substantially similar, and preferably has 
between at least 80%, and 99% amino acid sequence identity to the polypeptide of SEQ ID 
NO:36 or SEQ ID NO: 38, with each individual number within this range of between 80% 
and 99% also being part of the invention. 

In still a further specific embodiment , the invention relates to an purified nucleic acid 
molecule comprising a nucleotide sequence encoding a polypeptide that exhibits the 
enzymatic activity of a ferredoxin, which polypeptide is immunologically reactive with 
antibodies raised against a polypeptide of SEQ ID NO: 36 or SEQ ID NO: 38. 

The invention further provides a purified nucleic acid molecule comprising a 
nucleotide sequence 

a) as given in SEQ ID NO:35 or SEQ ID NO: 37; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO: 35 or SEQ ID 
NO: 37, or the complement thereof; 

e) complementary to (a), (b) or (c); 

f) which is the reverse complement of (a), (b) or (c); or 

g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a ferredoxin and regioselectively oxidizes avermectin 
to 4"-keto-avermectin . 
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In certain embodiments, the nucleic acid molecule encoding a ferredoxin of the 
invention comprises or consists essentially of a nucleic acid sequence that is at least 81% 
identical to SEQ ID NO:35 or SEQ ID NO:37. In some embodiments, the nucleic acid 
molecule comprises or consists essentially of a nucleic acid sequence that is at least 85%, 
or at least 90%, or at least 95%, or at least 99% identical to SEQ ID NO:35 or SEQ ID 
NO:37. In certain embodiments, the nucleic acid molecule encoding a ferredoxin of the 
invention comprises or consists essentially of the nucleic acid sequence of SEQ ID NO:35 
or SEQ ID NO:37. 

In yet another aspect, the invention provides a purified ferredoxin protein, wherein the 
ferredoxin protein is isolated from a Streptomyces strain comprising a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin. In certain 
embodiments, the ferredoxin of the invention comprises or consists essentially of an amino 
acid sequence that is at least 80% identical to SEQ ID NO:36 or SEQ ID NO:38. In some 
embodiments, the nucleic acid molecule comprises or consists essentially of an amino acid 
sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to 
SEQ ID NO:36 or SEQ ID NO:38. 

In particular embodiments, the ferredoxin of the invention comprises or consists 
essentially of the amino acid sequence of SEQ ID NO:36 or SEQ ID NO:38. 

In another aspect, the invention provides a purified nucleic acid molecule comprising a 
nucleotide sequence encoding a polypeptide according to the invention exhibiting an 
enzymatic activity of a ferredoxin reductase, wherein the nucleic acid molecule is isolated 
from a Streptomyces strain comprising a P450 monooxygenase that regioselectively 
oxidizes avermectin to 4"-keto-avermectin. 

In certain embodiments, the nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide according to the invention exhibiting an enzymatic activity of a 
ferredoxin reductase comprises or consists essentially of the nucleic acid sequence of SEQ 
ID NO:98, SEQ ID NO:100, SEQ ID NO:102, or SEQ ID NO:104. 

In yet another aspect, the invention provides a purified polypeptide exhibiting an 
enzymatic activity of a ferredoxin reductase protein, wherein the said polypeptide is 
isolated from a Streptomyces strain comprising a P450 monooxygenase that 
regioselectively oxidizes avermectin to 4"-keto-avermectin. In certain embodiments, the 
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polypeptide of the invention comprises or consists essentially of the amino acid sequence 
of SEQ ID NO:99, SEQ ID NO:101, SEQ ID NO:103, or SEQ ID NO:105. 

In another aspect, the invention provides a process for the preparation a compound of 
the formula 



in which 

R1-R9 represent, independently of each other hydrogen or a substituent; 

m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that 
two adjacent carbon atoms are connected by a double bond, a single bond, a single bond 
and a epoxide bridge of the formula 




H 




, or a single bond and a methylene bridge of the formula 
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including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 

thereof, in each case in free form or in salt form, 

which process comprises 

1) bringing a compound of the formula 




wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, 
into contact with a polypeptide according to the invention that is capable of 
regioselectively oxidising the alcohol at position 4" in order to form a compound of the 
formula 
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in which Rj, R 2 , R3, R4, R5, R6, R7, m, n, A, B, C, D, E and F have the meanings given 
for formula (I); and 

2) reacting the compound of the formula (HI) with an amine of the formula HN(R 8 )R 9 , 
wherein R 8 and R 9 have the same meanings as given for formula (I), and which is 
known, in the presence of a reducing agent; 

and, in each case, if desired, converting a compound of formula (I) obtainable in 
accordance with the process or by another method, or an E/Z isomer or tautomer thereof, 
in each case in free form or in salt form, into a different compound of formula (I) or an 
E/Z isomer or tautomer thereof, in each case in free form or in salt form, separating a 
mixture of E/Z isomers obtainable in accordance with the process and isolating the 
desired isomer, and/or converting a free compound of formula (I) obtainable in 
accordance with the process or by another method, or an E/Z isomer or tautomer thereof, 
into a salt or converting a salt, obtainable in accordance with the process or by another 
method, of a compound of formula (I) or of an E/Z isomer or tautomer thereof into the 
free compound of formula (I) or an E/Z isomer or tautomer thereof or into a different 
salt. 

In some embodiments, the compound of formula (II) is further brought into contact 
with a polypeptide according to the invention exhibiting an enzymatic activity of a 
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ferredoxin. In certain embodiments, the compound of formula (II) is further brought into 
contact with a polypeptide according to the invention exhibiting an enzymatic activity of a 
ferredoxin reductase. In some embodiments, the compound of formula (II) is further 
brought into contact with a reducing agent (e.g., NADH or NADPH). 

In still a further embodiment, the invention provides a process for the preparation of a 
compound of the formula 




in which Ri, R2, R3, R4, R5, Re? R7, m, n, A, B, C, D, E and F have the meanings given for 
formula (I) of claim 1, 
which process comprises 

1) bringing a compound of the formula 
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wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, 
into contact with a polypeptide according to the invention that is capable of 
regioselectively oxidising the alcohol at position 4", maintaining said contact for a time 
sufficient for the oxidation reaction to occur and isolating and purifying the compound 
of formula (II). 

In yet another embodiment, the invention provides a process according to the invention 
for the preparation of a compound of the formula (I), in which 
n is 1; 
m is 1; 

A is a double bond; 

B is single bond or a double bond, 

C is a double bond, 

D is a single bond, 

E is a double bond, 

F is a double bond; or a single bond and a epoxy bridge; or a single bond and a 
methylene bridge; 
Ri, R 2 and R 3 are H; 
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R 4 is methyl; 

R 5 is Ci-Cio-alkyl, C 3 -C 8 -cycloalkyl or C 2 -Cio-alkenyl; 
R 6 is H; 
R 7 is OH; 

R 8 and R 9 are independently of each other H; Ci-Cio-alkyl or Ci-Cio-acyl; or together 
form -(CH?) q -; and 
q is 4, 5 or 6. 

In still another embodiment, the invention provides a process according to the 
invention for the preparation of a compound of the formula (I), in which 
n is 1; 
m is 1; 

A, B, C, E and F are double bonds; 

D is a single bond; 

Ri, R 2 , and R 3 are H; 

R 4 is methyl; 

Rs is s-butyl or isopropyl; 

R 6 is H; 

R 7 is OH; 

Rg is methyl 

R 9 is H. 

In still another embodiment, the invention provides a process according to the 
invention for the preparation of 4"-deoxy-4"-N-methylamino avermectin Bi a /Bi b benzoate 
salt. 

In another aspect, the invention provides a method for making emamectin. The 
method comprises adding a polypeptide according to the invention exhibiting an enzymatic 
activity of a P450 monooxygenase that regioselectively oxidizes avermectin to 4"-keto- 
avermectin to a reaction mixture comprising avermectin and incubating the reaction 
mixture under conditions that allow the polypeptide to regioselectively oxidize avermectin 
to 4"-keto-avermectin. In some embodiments, the reaction mixture further comprises a 
polypeptide according to the invention exhibiting an enzymatic activity of a ferredoxin. In 
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certain embodiments, the reaction mixture further comprises a polypeptide according to the 
invention exhibiting an enzymatic activity of a ferredoxin reductase. In some 
embodiments, the reaction mixture further comprises a reducing agent (e.g., NADH or 
NADPH). 

■ In still another aspect, the invention provides a formulation for making a compound of 
formula (I) comprising a polypeptide according to the invention exhibiting a P450 
monooxygenase activity that is capable of regioselectively oxidising the alcohol at position 
4" in order to form a compound of formula (II). In some embodiments, the formulation 
further comprises a polypeptide according to the invention exhibiting an enzymatic activity 
of a ferredoxin (e.g., a ferredoxin from cell or strain from which the P450 monooxygenase 
was isolated or derived). 

■ In still another aspect, the invention provides a formulation for making emamectin 
comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4"-keto- 
avermectin. In some embodiments, the formulation further comprises a ferredoxin (e.g., a 
ferredoxin from cell or strain from which the P450 monooxygenase was isolated or 
derived). 

■ In certain embodiments, the formulation further comprises a polypeptide according to 
the invention exhibiting an enzymatic activity of a ferredoxin reductase (e.g., a ferredoxin 
from cell or strain from which the P450 monooxygenase was isolated or derived). In some 
embodiments, the formulation further comprises a reducing agent (e.g., NADH or 
NADPH). 

Brief Description of the Drawings 

Figure 1 is a diagrammatic representation showing a map of plasmid pTBBKA. 
Recognition sites by the indicated restriction endonucleases are shown, along with the 
location of the site in the nucleotide sequence of the plasmid. Also shown are genes (e.g., 
kanamycin resistance "KanR"), and other functional aspects (e.g., Tip promoter) contained in 
the plasmid. 

Figure 2 is a diagrammatic representation showing a map of plasmid pTUAl A. 
Recognition sites by the indicated restriction endonucleases are shown, along with the 



18 



WO 02/092801 



PCT/EP02/05363 



location of the site in the nucleotide sequence of the plasmid. Also shown are genes {e.g., 
ampicillin resistance "AmpR") and other functional aspects {e.g., Tip promoter) contained in 
the plasmid. 

Figure 3 is a diagrammatic representation showing a map of plasmid pRK-emal/fd233. 
This plasmid was derived by ligating a Bglll fragment containing the emal and fd233 genes 
organized on a single transcriptional unit into the Bgin site of the broad host-range plasmid 
pRK290. The emal/fd233 genes are expressed by the tac promoter (Ptac), and they are 
followed by the tac terminator (Ttac). Restriction endonuclease recognition sites shown are 
BglH "B"; EcoRI "E"; Pad "Pc"; Pmel "Pm"; and Sail "S." 

The present invention provides a family of polypeptides which exhibit an 
enzymatic activity of a P450 monooxygenases and are capable of regioselectively oxidizing 
the alcohol at position 4" of a compound of formular (II) such as avermectin in order to 
produce a compound of the formula (III), but especially 4"-keto-avermectin. 

More particularly, the family of polypeptides according to the invention may be used in 
a process for the preparation a compound of the formula 




in which 

R1-R9 represent, independently of each other hydrogen or a substituent; 
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m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two 
adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a 
epoxide bridge of the formula 

H O H 

, or a single bond and a methylene bridge of the formula 

including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 
thereof, in each case in free form or in salt form, 
which process comprises 

1) bringing a compound of the formula 




wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, 
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into contact with a polypeptide according to the invention which exhibits an enzymatic 
activity of a P450 monooxygenases and is capable of regioselectively oxidizing the alcohol at 
position 4" of formular (II) in order to produce a compound of the formula (HI) 




Cm), 



in which Ri, R2, R3, R4, R5> R6, R7> m, n, A, B, C, D, E and F have the meanings given for 
formula (I); and 

2) reacting the compound of the formula (HO with an amine of the formula HN(R 8 )R 9 , 
wherein R 8 and R 9 have the same meanings as given for formula (I), and which is known, in 
the presence of a reducing agent; . 

and, in each case, if desired, converting a compound of formula (I) obtainable in accordance 
with the process or by another method, or an E/Z isomer or tautomer thereof, in each case in 
free form or in salt form, into a different compound of formula (I) or an E/Z isomer or 
tautomer thereof, in each case in free form or in salt form, separating a mixture of E/Z isomers 
obtainable in accordance with the process and isolating the desired isomer, and/or converting 
a free compound of formula (I) obtainable in accordance with the process or by another 
method, or an E/Z isomer or tautomer thereof, into a salt or converting a salt, obtainable in 
accordance with the process or by another method, of a compound of formula (I) or of an E/Z 
isomer or tautomer thereof into the free compound of formula (I) or an E/Z isomer or tautomer 
thereof or into a different salt. 
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Methods of synthesis for the compounds of formula (I) are described in the literature. It 
has been found, however, that the processes known in the literature cause considerable 
problems during production basically on account of the low yields and the tedious procedures 
which have to be used. Accordingly, the known processes are not satisfactory in that respect, 
giving rise to the need to make available improved preparation processes for those 
compounds. 

The compounds (I), (II) and (DDT) may be in the form of tautomers. Accordingly, herein- 
before and hereinafter, where appropriate the compounds (I), (II) and (III) are to be understood 
to include corresponding tautomers, even if the latter are not specifically mentioned in each 
case. 

The compounds (I), (II) and (HI) are capable of forming acid addition salts. Those salts 
are formed, for example, with strong inorganic acids, such as mineral acids, for example 
perchloric acid, sulfuric acid, nitric acid, nitrous acid, a phosphoric acid or a hydrohalic acid, 
with strong organic carboxylic acids, such as unsubstituted or substituted, for example halo- 
substituted, Ci-C4alkanecarboxylic acids, for example acetic acid, saturated or unsaturated 
dicarboxylic acids, for example oxalic, malonic, succinic, maleic, fumaric or phthalic acid, 
hydroxycarboxylic acids, for example ascorbic, lactic, malic, tartaric or citric acid, or benzoic 
acid, or with organic sulfonic acids, such as unsubstituted or substituted, for example halo- 
substituted, Ci-C 4 alkane- or aryl-sulfonic acids, for example methane- or p-toluene-sulfonic 
acid. Furthermore, compounds of formula (I), (II) and (m) having at least one acidic group 
are capable of forming salts with bases. Suitable salts with bases are, for example, metal salts, 
such as alkali metal or alkaline earth metal salts, for example sodium, potassium or 
magnesium salts, or salts with ammonia or an organic amine, such as morpholine, piperidine, 
pyrrolidine, a mono-, di- or tri-lower alkylamine, for example ethyl-, diethyl-, triethyl- or 
di methyl -propyl - ami n e , or a mono-, di- or tri-hydroxy-lower alkylamine, for example mono-, 
di- or tri-ethanolamine. In addition, corresponding internal salts may also be formed. Prefe- 
rence is given within the scope of the invention to agrochemically advantageous salts. In view 
of the close relationship between the compounds of formula (I), (II) and (III) in free form and 
in the form of their salts, any reference hereinbefore or hereinafter to the free compounds of 
formula (I), (II) and (DI) or to their respective salts is to be understood as including also the 
corresponding salts or the free compounds of formula (I), (II) and (HI), where appropriate and 
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expedient. The same applies in the case of tautomers of compounds of formula (I), (II) and 
(EI) and the salts thereof. The free form is generally preferred in each case. 

Preferred within the scope of this invention is a process for the preparation of 
compounds of the formula (I), in which 

n is 1; 

m is 1; 

A is a double bond; 

B is single bond or a double bond, 

C is a double bond, 

D is a single bond, 

E is a double bond, 

F is a double bond; or a single bond and a epoxiy bridge; or a single bond and a 

methylene bridge; 
Ri,R 2 and R 3 are H; 
R4 is methyl; 

R 5 is Ci-Cio-alkyl, C 3 -C 8 -cycloalkyl or C 2 -Ci 0 -alkenyl; 
R 6 is H; 
R 7 is OH; 

R 8 and R 9 are independently of each other H; Ci-Cio-alkyl or Ci-C i0 -acyl; or together 
form -(CH 2 ) q -; and 
q is 4, 5 or 6. 

Especially preferred within the scope of this invention is a process for the preparation of 
a compound of the formula (I) in which 
n is 1; 
m is 1; 

A, B, C, E and F are double bonds; 
D is a single bond; 
Ri, R 2 , and R3 are H; 
R 4 is methyl; 

R 5 is s-butyl or isopropyl; 
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R 6 is H; 
R 7 is OH; 
Rg is methyl 
R 9 is H. 

Very especially preferred is a process for the preparation of emamectin, more 
particularly the benzoate salt of emamectin. Emamectin is a mixture of 4"-deoxy-4"-N- 
methylamino avermectin B u /B\ h and is described in US-P-4,4874,749 and as MK-244 in 
Journal of Organic Chemistry, Vol. 59 (1994), 7704-7708. Salts of emamectin that are 
especially valuable agrochemically are described in US-P-5,288,710. Each member of this 
family of peptides exhibiting an enzymatic activity of a P450 monooxygenases as described 
hereinbefore is able to oxidize unprotected avermectin regioselectively at position 4", thus 
opening a new and more economical route for the production of emamectin. 

The family members each catalyze the following reaction: 




/ / 



chemical conversion 
by reductive amination 




a vermectin 4"-keto-a vermectin 

Bla (R=CH 3 ) andBIb (R=H) Bla (R=CH 3 ) andBIb (R=H) 



emamectin 
R=CH 3 ,H 



Accordingly, the invention provides a purified nucleic acid molecule encoding a 
polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is capable of 
regioselectively oxidizing the alcohol at position 4" of a compound of formular (II) such as 
avermectin in order to produce a compound of formula (DI), but especially 4"-keto- 
avermectin. 

In particular, the invention provides a purified nucleic acid molecule encoding a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin. A "nucleic 
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acid molecule" refers to single-stranded or double-stranded polynucleotides, such as 
deoxyribonucleic acid (DNA), ribonucleic acid (RNA), or analogs of either DNA or RNA. 

The invention also provides a purified polypeptide that exhibits an enzymatic activity of 
a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol at position 4" 
of a compound of formular (II) such as avermectin in order to produce a compound of formula 
(III), but especially 4"-keto-avermectin. 

In particular, the invention also provides a purified P450 monooxygenase that 
regioselectively oxidizes avermectin to 4"-keto-avermectin. 

As used herein, by "purified" is meant a nucleic acid molecule or polypeptide (e.g., an 
enzyme or antibody) that has been separated from components which naturally accompany it. 
An example of such a nucleotide sequence or segment of interest "purified" from a source, 
would be nucleotide sequence or segment that is excised or removed from said source by 
chemical means, e.g., by the use of restriction endonucleases, so that it can be further 
manipulated, e.g., amplified, for use in the invention, by the methodology of genetic 
engineering. Such a nucleotide sequence or segment is commonly referred to as 
"recombinant.". In one specific aspect, the purified nucleic acid molecule may be separated 
from nucleotide sequences, such as promoter or enhancer sequences, that flank the nucleic 
acid molecule as it naturally occurs in the chromosome. 

In the case of a protein or a polypeptide, the purified protein and polypeptide, 
respectively, is separated from components, such as other proteins or fragments of cell 
membrane, that accompany it in the cell. Of course, those of ordinary skill in molecular 
biology will understand that water, buffers, and other small molecules may additionally be 
present in a purified nucleic acid molecule or purified protein preparation. A purified nucleic 
acid molecule or purified polypeptide (e.g., enzyme) of the invention that is at least 95% by 
weight, or at least 98% by weight, or at least 99% by weight, or 100% by weight free of 
components which naturally accompany the nucleic acid molecule or polypeptide. 

According to the invention, a purified nucleic acid molecule may be generated, for 
example, by excising the nucleic acid molecule from the chromosome. It may then be ligated 
into an expression plasmid. Other methods for generating a purified nucleic acid molecule 
encoding a P450 monooxygenase of the invention are available and include, without 
limitation, artificial synthesis of the nucleic acid molecule on a nucleic acid synthesizer. 
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Similarly, a purified P450 monooxygenase of the invention may be generated, for 
example, by recombinant expression of a nucleic acid molecule encoding the P450 
monooxygenase in a cell in which the P450 monooxygenase does not naturally occur. Of 
course, other methods for obtaining a purified P450 monooxygenase of the invention include, 
without limitation, artificial synthesis of the P450 monooxygenase on a polypeptide 
synthesizer and isolation of the P450 monooxygenase from a cell in which it naturally occurs 
using, e.g., an antibody that specifically binds the P450 monooxygenase. 

Biotransformations of secondary alcohols to ketones by Streptomyces bacteria are 
known to be catalyzed by dehydrogenases or oxidases. However, prior to the present 
discovery of the cytochrome P450 monooxygenase from Streptomyces tubercidicus strain R- 
922 responsible for the regioselective oxidation of avermectin to 4"-keto-avermectin, no 
experimental data of another cytochrome P450 monooxygenase from Streptomyces to oxidize 
a secondary alcohol to a ketone had been reported. 

According to some embodiments of the invention, the nucleic acid molecule and/or the 
polypeptide encoded by the nucleic acid molecule are isolated from a Streptomyces strain. 
Thus, the nucleic acid molecule (or polypeptide encoded thereby) may be isolated from, 
without limitation, Streptomyces tubercidicus, Streptomyces lydicus, Streptomyces platensis, 
Streptomyces chattanoogensis, Streptomyces kasugaensis, Streptomyces rimosus, and 
Streptomyces albofaciens. 

As mentioned above and described in more detail below, an entire family of 
polypeptides exhibiting an enzymatic activity of P450 monooxygenases capable of 
regioselectively oxidizing avermectin to 4"-keto-avermectin are provided herein. All of these 
family members are related by at least 60% identity at the amino acid level. A useful nucleic 
acid molecule comprising a nucleotide sequence encoding a polypeptide of the invention 
exhibiting an enzymatic activity of a P450 monooxygenase comprises or consists essentially 
of a nucleic acid sequence that is at least 70% identical to SEQ ID NO:l, SEQ ID NO:3, SEQ 
ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, 
SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID 
NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94. In certain 
embodiments, the nucleic acid molecule comprising a nucleotide sequence encoding a 
polypeptide of the invention exhibiting an enzymatic activity of a P450 monooxygenase 
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comprises or consists essentially of a nucleic acid sequence that is at least 80% identical; or at 
least 85% identical; or at least 90% identical; or at least 95% identical; or at least 98% 
identical to SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ 
ID NO-.ll, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO: 19, SEQ ID NO:21, 
SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
NO:33, or SEQ ID NO:94. 

Similarly, the invention provides a purified polypeptide exhibiting an enzymatic activity 
of a P450 monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin 
which, in some embodiments, comprises or consists essentially of an amino acid sequence that 
is at least 60% identical to SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ 
ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, 
SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:34, or SEQ ID NO:95. In certain embodiments, the purified polypeptide 
of the invention exhibiting an enzymatic activity of a P450 monooxygenase comprises or 
consists essentially of an amino acid sequence that is at least 70% identical; or at least 80% 
identical; or at least 90% identical; or at least 95% identical to SEQ ID NO:2, SEQ ID NO:4, 
SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID 
NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, 
SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 

In some embodiments, the nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide of the invention exhibiting an enzymatic activity of a P450 
monooxygenase comprises or consists essentially of the nucleic acid sequence of SEQ ID 
NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11, SEQ ID 
NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID 
NO:94. Similarly, the purified polypeptide of the invention exhibiting an enzymatic activity 
of a P450 monooxygenase, in some embodiments, comprises or consists essentially of the 
amino acid sequence of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID 
NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, 
SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID 
NO:32, SEQ ID NO:34, or SEQ ID NO:95. 
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To describe the sequence relationships between two or more nucleic acids or 
polynucleotides the following terms are used: (a) "reference sequence", (b) "comparison 
window", (c) "sequence identity", (d) "percentage of sequence identity", and (e) "substantial 
identity". 

(a) As used herein, "reference sequence" is a defined sequence used as a basis for 
sequence comparison. A reference sequence may be a subset or the entirety of a specified 
sequence; for example, as a segment of a full length cDNA or gene sequence, or the complete 
cDNA or gene sequence. 

(b) As used herein, "comparison window" makes reference to a contiguous and 
specified segment of a polynucleotide sequence, wherein the polynucleotide sequence in the 
comparison window may comprise additions or deletions (i.e., gaps) compared to the 
reference sequence (which does not comprise additions or deletions) for optimal alignment of 
the two sequences. Generally, the comparison window is at least 20 contiguous nucleotides in 
length, and optionally can be 30, 40, 50, 100, or longer. Those of skill in the art understand 
that to avoid a high similarity to a reference sequence due to inclusion of gaps in the 
polynucleotide sequence a gap penalty is typically introduced and is subtracted from the 
number of matches. 

Methods of alignment of sequences for comparison are well known in the art. Thus, 
the determination of percent identity between any two sequences can be accomplished using a 
mathematical algorithm. Preferred, non-limiting examples of such mathematical algorithms 
are the algorithm of Myers and Miller, 1988; the local homology algorithm of Smith et al. 
1981; the homology alignment algorithm of Needleman and Wunsch 1970; the search-for- 
similarity-method of Pearson and Lipman 1988; the algorithm of Karlin and Altschul, 1990, 
modified as in Karlin and Altschul, 1993. 

Computer implementations of these mathematical algorithms can be utilized for 
comparison of sequences to determine sequence identity. Such implementations include, but 
are not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, 
Mountain View, California); the ALIGN program (Version 2.0) and GAP, BESTFTT, BLAST, 
FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 (available 
from Genetics Computer Group (GCG), 575 Science Drive, Madison, Wisconsin, USA). 
Alignments using these programs can be performed using the default parameters. The 
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CLUSTAL program is well described by Higgins et al. 1988; Higgins et al. 1989; Corpet et al. 
1988; Huang et al. 1992; and Pearson et al. 1994. The ALIGN program is based on the 
algorithm of Myers and Miller, supra. The BLAST programs of Altschul et aL, 1990, are 
based on the algorithm of Karlin and Altschul supra. 

Software for performing BLAST analyses is publicly available through the National 
Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). This algorithm 
involves first identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence, which either match or satisfy some positive-valued threshold 
score T when aligned with a word of the same length in a database sequence. T is referred to 
as the neighborhood word score threshold (Altschul et al., 1990). These initial neighborhood 
word hits act as seeds for initiating searches to find longer HSPs containing them. The word 
hits are then extended in both directions along each sequence for as far as the cumulative 
alignment score can be increased. Cumulative scores are calculated using, for nucleotide 
sequences, the parameters M (reward score for a pair of matching residues; always > 0) and N 
(penalty score for mismatching residues; always < 0). For amino acid sequences, a scoring 
matrix is used to calculate the cumulative score. Extension of the word hits in each direction 
are halted when the cumulative alignment score falls off by the quantity X from its maximum 
achieved value, the cumulative score goes to zero or below due to the accumulation of one or 
more negative-scoring residue alignments, or the end of either sequence is reached. 

In addition to calculating percent sequence identity, the BLAST algorithm also performs 
a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul 
(1993). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P(N)), which provides an indication of the probability by which a match between 
two nucleotide or amino acid sequences would occur by chance. For example, a test nucleic 
acid sequence is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid sequence to the reference nucleic acid sequence is less 
than about 0.1, more preferably less than about 0.01, and most preferably less than about 
0.001. 

To obtain gapped alignments for comparison purposes, Gapped BLAST (in BLAST 
2,0) can be utilized as described in Altschul et al. 1997. Alternatively, PSLBLAST (in 
BLAST 2.0) can be used to perform an iterated search that detects distant relationships 
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between molecules. See Altschul et al., supra. When utilizing BLAST, Gapped BLAST, PSI- 
BLAST, the default parameters of the respective programs (e.g. BLASTN for nucleotide 
sequences, BLASTX for proteins) can be used. The BLASTN program (for nucleotide 
sequences) uses as defaults a wordlength (W) of 11, an expectation (E) of 10, a cutoff of 100, 
M=5, N=-4, and a comparison of both strands. For amino acid sequences, the BLASTP 
program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the 
BLOSUM62 scoring matrix (see Henikoff & Henikoff, 1989). See 

http://www.ncbi.nlm.nih.gov. Alignment may also be performed manually by inspection. 
For purposes of the present invention, comparison of nucleotide sequences for determination 
of percent sequence identity to the nucleotide sequences disclosed herein is preferably made 
using the BlastN program (version 1.4.7 or later) with its default parameters or any equivalent 
program. By "equivalent program" is intended any sequence comparison program that, for 
any two sequences in question, generates an alignment having identical nucleotide or amino 
acid residue matches and an identical percent sequence identity when compared to the 
corresponding alignment generated by the preferred program. 

(c) As used herein, "sequence identity" or "identity" in the context of two nucleic acid 
or polypeptide sequences makes reference to the residues in the two sequences that are the 
same when aligned for maximum correspondence over a specified comparison window. 
When percentage of sequence identity is used in reference to proteins it is recognized that 
residue positions which are not identical often differ by conservative amino acid substitutions, 
where amino acid residues are substituted for other amino acid residues with similar chemical 
properties (e.g., charge or hydrophobic! ty) and therefore do not change the functional 
properties of the molecule. When sequences differ in conservative substitutions, the percent 
sequence identity may be adjusted upwards to correct for the conservative nature of the 
substitution. Sequences that differ by such conservative substitutions are said to have 
"sequence similarity" or "similarity." Means for making this adjustment are well known to 
those of skill in the art. Typically this involves scoring a conservative substitution as a partial 
rather than a full mismatch, thereby increasing the percentage sequence identity. Thus, for 
example, where an identical amino acid is given a score of 1 and a non-conservative 
substitution is given a score of zero, a conservative substitution is given a score between zero 
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and 1. The scoring of conservative substitutions is calculated, e.g., as implemented in the 
program PC/GENE (Intelligenetics, Mountain View, California). 

(d) As used herein, "percentage of sequence identity" means the value determined by 
comparing two optimally aligned sequences over a comparison window, wherein the portion 
of the polynucleotide sequence in the comparison window may comprise additions or 
deletions (i.e., gaps) as compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two sequences. The percentage is 
calculated by determining the number of positions at which the identical nucleic acid base or 
amino acid residue occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the window of 
comparison, and multiplying the result by 100 to yield the percentage of sequence identity. 
(e)(i) The term "substantial identity" of polynucleotide sequences means that a polynucleotide 
comprises a sequence that has at least 66%. 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 
75%, 76%, 77%, 78%, or 79%, preferably at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 
87%, 88%, or 89%, more preferably at least 90%, 91%, 92%, 93%, or 94%, and most 
preferably at least 95%, 96%, 97%, 98%, or 99% sequence identity, compared to a reference 
sequence using one of the alignment programs described using standard parameters. One of 
skill in the art will recognize that these values can be appropriately adjusted to determine 
corresponding identity of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning, and the like. 
Substantial identity of amino acid sequences for these purposes normally means sequence 
identity of at least 70%, more preferably at least 80%, 90%, and most preferably at least 95%. 

Another indication that nucleotide sequences are substantially identical is if two 
molecules hybridize to each other under stringent conditions (see below). Generally, stringent 
conditions are selected to be about 5°C lower than the thermal melting point (T m ) for the 
specific sequence at a defined ionic strength and pH. However, stringent conditions 
encompass temperatures in the range of about 1°C to about 20°C, depending upon the desired 
degree of stringency as otherwise qualified herein. Nucleic acids that do not hybridize to each 
other under stringent conditions are still substantially identical if the polypeptides they encode 
are substantially identical. This may occur, e.g., when a copy of a nucleic acid is created 
using the maximum codon degeneracy permitted by the genetic code. One indication that two 
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nucleic acid sequences are substantially identical is when the polypeptide encoded by the first 
nucleic acid is immunologically cross reactive with the polypeptide encoded by the second 
nucleic acid. 

(e)(ii) The term "substantial identity" in the context of a polypeptide indicates that a 
polypeptide comprises a sequence with at least 50%, 60%, 70%, 71%, 72%, 73%, 74%, 75%, 
76%, 77%, 78%, or 79%, preferably 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, or 
89%, more preferably at least 90%, 91%, 92%, 93%, or 94%, or even more preferably, 95%, 
96%, 97%, 98% or 99%, sequence identity to the reference sequence over a specified 
comparison window. Preferably, optimal alignment is conducted using the homology 
alignment algorithm of Needleman and Wunsch (1970). An indication that two polypeptide 
sequences are substantially identical is that one polypeptide is immunologically reactive with 
antibodies raised against the second polypeptide. Thus, a polypeptide is substantially identical 
to a second polypeptide, for example, where the two peptides differ only by a conservative 
substitution. 

For sequence comparison, typically one sequence acts as a reference sequence to which 
test sequences are compared. When using a sequence comparison algorithm, test and 
reference sequences are input into a computer, subsequence coordinates are designated if 
necessary, and sequence algorithm program parameters are designated. The sequence 
comparison algorithm then calculates the percent sequence identity for the test sequence(s) 
relative to the reference sequence, based on the designated program parameters. 

As noted above, another indication that two nucleic acid sequences are substantially 
identical is that the two molecules hybridize to each other under stringent conditions. The 
phrase "hybridizing specifically to" refers to the binding, duplexing, or hybridizing of a 
molecule only to a particular nucleotide sequence under stringent conditions when that 
sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) 
substantially" refers to complementary hybridization between a probe nucleic acid and a target 
nucleic acid and embraces minor mismatches that can be accommodated by reducing the 
stringency of the hybridization media to achieve the desired detection of the target nucleic 
acid sequence. 

"Stringent hybridization conditions" and "stringent hybridization wash conditions" in 
the context of nucleic acid hybridization experiments such as Southern and Northern 
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hybridization are sequence dependent, and are different under different environmental 
parameters. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. Specificity is typically the 
function of post-hybridization washes, the critical factors being the ionic strength and 
temperature of the final wash solution. For DNA-DNA hybrids, the T m can be approximated 
from the equation of Meinkoth and Wahl, 1984; T m 81.5°C + 16.6 (log M) +0.41 (%GC) - 

0. 61 (% form) - 500/L; where M is the molarity of monovalent cations, %GC is the 
percentage of guanosine and cytosine nucleotides in the DNA, % form is the percentage of 
formamide in the hybridization solution, and L is the length of the hybrid in base pairs. T m is 
reduced by about 1°C for each 1% of mismatching; thus, T m , hybridization, and/or wash 
conditions can be adjusted to hybridize to sequences of the desired identity. For example, if 
sequences with >90% identity are sought, the T m can be decreased 10°C. Generally, stringent 
conditions are selected to be about 5°C lower than the thermal melting point I for the specific 
sequence and its complement at a defined ionic strength and pH. However, severely stringent 
conditions can utilize a hybridization and/or wash at 1, 2, 3, or 4°C lower than the thermal 
melting point I; moderately stringent conditions can utilize a hybridization and/or wash at 6, 7, 
8, 9, or 10°C lower than the thermal melting point I; low stringency conditions can utilize a 
hybridization and/or wash at 11, 12, 13, 14, 15, or 20°C lower than the thermal melting point 

1. Using the equation, hybridization and wash compositions, and desired T, those of ordinary 
skill will understand that variations in the stringency of hybridization and/or wash solutions 
are inherently described. If the desired degree of mismatching results in a T of less than 45°C 
(aqueous solution) or 32°C (formamide solution), it is preferred to increase the SSC 
concentration so that a higher temperature can be used. An extensive guide to the 
hybridization of nucleic acids is found in Tijssen, 1993. Generally, highly stringent 
hybridization and wash conditions are selected to be about 5°C lower than the thermal melting 
point T m for the specific sequence at a defined ionic strength and pH. 

An example of highly stringent wash conditions is 0.15 M NaCl at 72°C for about 15 
minutes. An example of stringent wash conditions is a 0.2X SSC wash at 65°C for 15 minutes 
(see, Sambrook, infra, for a description of SSC buffer). Often, a high stringency wash is 
preceded by a low stringency wash to remove background probe signal. An example medium 
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stringency wash for a duplex of, e.g., more than 100 nucleotides, is IX SSC at 45°C for 15 
minutes. An example low stringency wash for a duplex of, e.g., more than 100 nucleotides, is 
4-6X SSC at 40°C for 15 minutes. For short probes (e.g., about 10 to 50 nucleotides), 
stringent conditions typically involve salt concentrations of less than about 1 .5 M, more 
preferably about 0.01 to 1.0 M, Na ion concentration (or other salts) at pH 7.0 to 8.3, and the 
temperature is typically at least about 30°C and at least about 60°C for long robes (e.g., >50 
nucleotides). Stringent conditions may also be achieved with the addition of destabilizing 
agents such as formamide. In general, a signal to noise ratio of 2X (or higher) than that 
observed for an unrelated probe in the particular hybridization assay indicates detection of a 
specific hybridization. Nucleic acids that do not hybridize to each other under stringent 
conditions are still substantially identical if the proteins that they encode are substantially 
identical. This occurs, e.g., when a copy of a nucleic acid is created using the maximum 
codon degeneracy permitted by the genetic code. 

Very stringent conditions are selected to be equal to the T m for a particular probe. An 
example of stringent conditions for hybridization of complementary nucleic acids which have 
more than 100 complementary residues on a filter in a Southern or Northern blot is 50% 
formamide, e.g., hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37°C, and a wash in 
0. IX SSC at 60 to 65°C. Exemplary low stringency conditions include hybridization with a 
buffer solution of 30 to 35% formamide, 1 M NaCl, 1% SDS (sodium dodecyl sulphate) at 
37°C, and a wash in IX to 2X SSC (20X SSC = 3.0 M NaCl/0.3 M trisodium citrate) at 50 to 
55°C. Exemplary moderate stringency conditions include hybridization in 40 to 45% 
formamide, 1.0 M NaCl, 1% SDS at 37°C, and a wash in 0.5X to IX SSC at 55 to 60°C. 

The following are examples of sets of hybridization/wash conditions that may be used to 
clone orthologous nucleotide sequences that are substantially identical to reference nucleotide 
sequences of the present invention: a reference nucleotide sequence preferably hybridizes to 
the reference nucleotide sequence in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM 
EDTA at 50°C with washing in 2X SSC, 0.1% SDS at 50°C, more desirably in 7% sodium 
dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in IX SSC, 0.1% 
SDS at 50°C, more desirably still in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM 
EDTA at 50°C with washing in 0.5X SSC, 0.1% SDS at 50°C, preferably in 7% sodium 
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dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM EDTA at 50°C with washing in 0.1X SSC, 0.1% 
SDS at 50°C, more preferably in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP0 4 , 1 mM 
EDTA at 50°C with washing in 0.1X SSC, 0.1% SDS at 65°C. 

One non-limiting source of a purified polypeptide of the invention exhibiting an 
enzymatic activity of a P450 monooxygenase that regioselectively oxidizes avermectin to 4"- 
keto-avermectin is the cell-free extract described in the examples below. Another method for 
purifying a polypeptide exhibiting a P450 monooxygenase activity in accordance with the 
invention is to attach a tag to the protein, thereby facilitating its purification. Accordingly, the 
invention provides a purified polypeptide exhibiting an enzymatic activity of a P450 
monooxygenase which regioselectively oxidizes avermectin to 4"-keto-avermectin, wherein 
the polypeptide is covalently bound to a tag. The invention further provides a nucleic acid 
molecule encoding such a tagged polypeptide. 

As used herein, a "tag" is meant a polypeptide or other molecule covalently bound to a 
polypeptide of the invention, whereby a binding agent (e.g., a polypeptide or molecule) 
specifically binds the tag. In accordance with the invention, by "specifically binds" is meant 
that the binding agent (e.g., an antibody or Ni 2+ resin) recognizes and binds to a particular 
polypeptide or chemical but does not substantially recognize or bind to other molecules in the 
sample. In some embodiments, a binding agent that specifically binds a ligand forms an 
association with that ligand with an affinity of at least 10 6 M"\ or at least 10 7 M"\ or at least 
10 8 M"\ or at least 10 9 M _1 either in water, under physiological conditions, or under conditions 
which approximate physiological conditions with respect to ionic strength, e.g., 140 mM 
NaCl, 5 mM MgCl 2 . For example, a His tag is specifically bound by nickel (e.g., the Ni 2+ - 
charged column commercially available as HisrBind® Resin from Novagen Inc, Madison, 
WI). Likewise, a Myc tag is specifically bound by an antibody that specifically binds Myc. 

As described below, a His tag is attached to the purified polypeptide of the invention 
exhibiting an enzymatic activity of a P450 monooxygenase by generating a nucleic acid 
molecule encoding the His-tagged polypeptide, and expressing the polypeptide in E. coli. 
These polypeptides, once expressed by E. coli, are readily purified by standard techniques 
(e.g., using one of the His»Bind® Kits commercially available from Novagen or using the 
TALON™ Resin (and manufacturer's instructions) commercially available from Clontech 
Laboratories, Inc., Palo Alto, CA). 
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Additional tags may be attached to any or all of the polypeptides of the invention to 
facilitate purification. These tags include, without limitation, the HA-Tag (amino acid 
sequence: YPYDVPDYA (SEQ ID NO:39)), the Myc-tag (amino acid sequence: 
EQKLISEEDL (SEQ ID NO:40)), the HSV tag (amino acid sequence: QPELAPEDPED (SEQ 
ID NO:41)), and the VSV-G-Tag (amino acid sequence: YTDEMNRLGK (SEQ ID NO:42)). 
Covalent attachment (e.g., via a polypeptide bond) of these tags to a polypeptide of the 
invention allows purification of the tagged polypeptide using, respectively, an anti-HA 
antibody, an anti-Myc antibody, an anti-HSV antibody, or an anti-VSV-G antibody, all of 
which are commercially available (for example, from MBL International Corp., Watertown, 
MA; Novagen Inc.; Research Diagnostics Inc., Flanders, NJ). 

The tagged polypeptides of the invention exhibiting a P450 monooxygenase activity 
may also be tagged by a covalent bond to a chemical, such as biotin, which is specifically 
bound by streptavidin, and thus may be purified on a streptavidin column. Similarly, the 
tagged P450 monooxygenases of the invention may be covalently bound (e.g., via a 
polypeptide bond) to the constant region of an antibody. Such a tagged P450 monooxygenase 
may be purified, for example, on protein A sepharose. 

The tagged P450 monooxygenases of the invention may also be tagged to a GST 
(glutathione-S-transferase) or the constant region of an immunoglobulin. For example, a 
nucleic acid molecule of the invention (e.g., comprising SEQ ID NO:l) can be cloned into one 
of the pGEX plasmids commercially available from Amersham Pharmacia Biotech, Inc. 
(Piscataway NJ), and the plasmid expressed in E. coli. The resulting P450 monooxygenase 
encoded by the nucleic acid molecule is covalently bound to a GST (glutathione-S- 
transferase). These GST fusion proteins can be purified on a glutathione agarose column 
(commercially available from, e.g., Amersham Pharmacia Biotech), and thus purified. Many 
of the pGEX plasmids enable easy removal of the GST portion from the fusion protein. For 
example, the pGEX-2T plasmid contains a thrombin recognition site between the inserted 
nucleic acid molecule of interest and the GST-encoding nucleic acid sequence. Similarly, the 
pGES-3T plasmid contains a factor Xa site. By treating the fusion protein with the 
appropriate enzyme, and then separating the GST portion from the P450 monooxygenase of 
the invention using glutathione agarose (to which the GST specifically binds), the P450 
monooxygenase of the invention can be purified. 
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Yet another method to obtain a purified polypeptide of the invention exhibiting a P450 
monooxygenase activity is to use a binding agent that specifically binds to such a polypeptide. 
Accordingly, the invention provides a binding agent that specifically binds to a P450 
monooxygenase of the invention. This binding agent of the invention may be a chemical 
compound (e.g., a protein), a metal ion, or a small molecule. 

In particular embodiments, the binding agent is an antibody. The term "antibody" 
encompasses, without limitation, polyclonal antibody, monoclonal antibody, antibody 
fragments (e.g., Fab, Fv, or Fab' fragments), single chain antibody, chimeric antibody, bi- 
specific antibody, antibody of any isotype (e.g., IgG, IgA, and IgE), and antibody from any 
specifies (e.g., rabbit, mouse, and human). 

In one non-limiting example, the binding agent of the invention is a polyclonal antibody. 
In another non-limiting example, the binding agent of the invention is a monoclonal antibody. 
Methods for making both monoclonal and polyclonal antibodies are well known (see, e.g., 
Current Protocols in Immunology , ed. John E. Coligan, John Wiley & Sons, Inc. 1993; 
Current Protocols in Molecular Biology , eds. Ausubel et ah, John Wiley & Sons, Inc. 2000). 

The polypeptides described herein exhibiting an enzymatic activity of a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin belong to a 
family of novel P450 monooxygenases. Accordingly, the invention also provides a family of 
P450 monooxygenase polypeptides, wherein each member of the family regioselectively 
oxidizes avermectin to 4"-keto-avermectin. In some embodiments, each member of the 
family comprises or consists of an amino acid sequence that is at least 50% identical to SEQ 
ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ 
ID NO:14, SEQ ID NO:16, SEQ ID NO:18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, 
SEQ ID NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID 
NO:95. In particular embodiments, each member of the family is encoded by a nucleic acid 
molecule comprising or consisting of a nucleic acid sequence that is at least 66% identical to 
SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:ll, 
SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID 
NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or 
SEQ ID NO:94. 
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The present invention, which provides an entire family of P450 monooxygenases, each 
member of which is able to regioselectively oxidize avermectin to 4"-keto-avermectin, 
allowed for the generation of an improved P450 monooxygenase, which may not be naturally 
occurring, but which regioselectively oxidizes avermectin to 4"-keto-avermectin with 
efficiency and with reduced undesirable side product. For instance, one of the members of the 
P450 monooxygenase family of the invention, P450 Em ai enzyme catalyzes a further oxidation 
that is not desirable, since the formation of 3"-0-demethyl-4"-keto-avermectin has been 
detected in the reaction by Streptomyces tubercidicus strain R-922 and by Streptomyces 
lividans containing the emal gene. The formation of 3"-0-demethyl-4"-keto-avermectin is 
brought about by the oxidation of the 3"-0-methyl group, whereby the hydrolytically labile 
3"-0-hydroxymethyl group is formed which hydrolyzes to form formaldehyde and the 3"- 
hydroxyl group. 

By providing a family of polypeptides exhibiting an enzymatic activitiy of P450 
monooxygenases that regioselectively oxidize avermectin to 4"-keto-avermectin (see, e.g., 
Table 3 below), individual members of the family can be subjected to family gene shuffling 
efforts in order to produce new hybrid genes encoding optimized P450 monooxygenases of 
the invention. In one non-limiting example, a portion of the emal gene encoding the 0 2 
binding site of the P450 En iai protein can be swapped with the portion of the emal gene 
encoding the 0 2 binding site of the P450 Em a2 protein. Such a chimeric em a 1/2 protein is 
within definition of a P450 monooxygenase of the invention. 

Site-directed mutagenesis or directed evolution technologies may also be employed to 
generate derivatives of the emal gene that encode enzymes with improved properties, 
including higher overall activity and/or reduced side product formation. One method for 
deriving such a mutant is to mutate the Streptomyces strain itself, in a manner similar to the 
UV mutation of Streptomyces tubercidicus strain R-922 described below. 

Additional derivatives may be made by making conservative or non-conservative 
changes to the amino acid sequence of a P450 monooxygenase. Conservative and non- 
conservative amino acid substitutions are well known (see, e.g., Stryer, Biochemistry ^ 3 rd Ed., 
W.H. Freeman and Co., NY 1988). Similarly, truncations of a P450 monooxygenase of the 
invention may be generated by truncating the protein at its N-terminus (e.g., see the emal A 
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gene described below), at its C-terminus, or truncating {i.e., removing amino acid residues) 
from the middle of the protein. 

Such a mutant, derivative, or truncated P450 monooxygenase is a P450 monooxygenase 
of the invention as long as the mutant, derivative, or truncated P450 monooxygenase is able to 
regioselectively oxidize avermectin to 4"-keto-avermectin. 

In another aspect, the invention provides a cell genetically engineered to comprise a 
nucleic acid molecule encoding a polypeptide which exhibits an enzymatic activity of a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin. By 
"genetically engineered" is meant that the nucleic acid molecule is exogenous to the cell into 
which it is introduced. Introduction of the exogenous nucleic acid molecule into the 
genetically engineered cell may be accomplished by any means, including, without limitation, 
transfection, transduction, and transformation. 

In certain embodiments, the nucleic acid molecule is positioned for expression in the 
genetically engineered cell. By "positioned for expression" is meant that the exogenous 
nucleic acid molecule encoding the polypeptide is linked to a regulatory sequence in such a 
way as to permit expression of the nucleic acid molecule when introduced into a cell. By 
"regulatory sequence" is meant nucleic acid sequences, such as initiation signals, 
polyadenylation (polyA) signals, promoters, and enhancers, which control expression of 
protein coding sequences with which they are operably linked. By "expression" of a nucleic 
acid molecule encoding a protein or polypeptide fragment is meant expression of that nucleic 
acid molecule as protein and/or mRNA. 

A genetically engineered cell of the invention may be a prokayotic cell (e.g., E. coli) or a 
eukaryotic cell (e.g., Saccharomyces cerevisiae or mammalian cell (e.g., HeLa)). According 
to some embodiments of the invention, the genetically engineered cell is a cell wherein the 
wild-type (i.e., not genetically engineered) cell does not naturally contain the inserted nucleic 
acid molecule and does not naturally express the protein encoded by the inserted nucleic acid 
molecule. Accordingly, the cell may be a genetically engineered Streptomyces strain, such as 
a Streptomyces lividans or a Streptomyces avermitilis strain. Alternatively, the cell may be a 
genetically engineered Pseudomonas strain, such as a Pseudomonas putida strain or a 
Pseudomonas fluorescens strain. In another alternative, the cell may be a genetically 
engineered Escherichia coli strain. 
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Note that in some types of cells genetically engineered to comprise a nucleic acid 
molecule encoding a polypeptide which exhibits an enzymatic activity of a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin, the actual 
genetically engineered cell, itself, may not be able to convert avermectin into 4"-keto- 
avermectin. Rather, the P450 monooxygenase heterogously expressed by such a genetically 
engineered cell may be purified from that cell, where the purified P450 monooxygenase of the 
invention can be used to regioselectively oxidize avermectin to 4"-keto-avermectin. Thus, the 
genetically engineered cell of the invention need not, itself, be able to regioselectively convert 
avermectin to 4"-keto-avermectin; rather, the genetically engineered cell of the invention need 
only comprise a nucleic acid molecule encoding a polypeptide which exhibits an enzymatic 
activity of a P450 monooxygenase that regioselectively oxidizes avermectin to 4"-keto- 
avermectin, regardless of whether the polypeptide is active inside that cell. 

In addition, a cell (e.g., E. coli) geneticially engineered to comprise a nucleic acid 
molecule encoding a polypeptide of the invention which exhibits an enzymatic activity of a 
P450 monooxygenase may not be able to regioselectively oxidize avermectin to 4"-keto- 
avermectin, although the P450 monooxygenase purified from the genetically engineered cell is 
able to regioselectively oxidize avermectin to 4"-keto-avermectin. However, if the same cell 
were genetically engineered to comprise a polypeptide of the invention which exhibits an 
enzymatic activity of a P450 monooxygenase, a ferredoxin of the invention, and/or a 
ferredoxin reductase of the invention, then the P450 monooxygenase together with the 
ferredoxin and the ferredoxin reductase, all purified from that cell, and in the presence of a 
reducing agent (e.g., NADH or NADPH), would be able to regioselectively oxidize 
avermectin to 4"-keto-avermectin. Furthermore the genetically engineered cell comprising a 
P450 monooxygenase of the invention, a ferredoxin of the invention, and a ferredoxin 
reductase of the invention, itself 7 might be able to carry out this oxidation. 

Moreover, in a non-limiting example where a cell (e.g., E. coli) is genetically engineered 
to express P450 monooxygenase, a ferredoxin, and a ferredoxin reductase proteins of the 
invention, all three of these proteins, when purified from the genetically engineered E. coli, 
are together and in the presence of a reducing agent (e.g., NADH or NADPH) would be active 
and able to regioselectively oxidize avermectin to 4"-keto-avermectin, and so are useful in a 
method for making emamectin. 
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In accordance with the present invention, the following material has been deposited with 
the Agricultural Research Service, Patent Culture Collection (NRRL), 1815 North University 
Street, Peoria, Illinois 61604, under the terms of the Budapest Treaty on the International 
Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure: (1) 
Streptomyces lividans ZX7 (emal/fd233-TUAlA) NRRL Designation No. B-30478; and (2) 
Pseudomonas putida NRRL B-4067 containing plasmid pRK290-emal/fd233, NRRL 
Designation No.B-30479 

In identifying the novel family of polypeptides exhibiting an enzymatic activity of P450 
monooxygenases that regioselectively oxidize avermectin to 4"-keto-avermectin, novel 
ferredoxins and novel ferredoxin reductases were also identified in the same strains of bacteria 
in which the P450 monooxygenases were found. Accordingly, in a further aspect, the 
invention provides a purified nucleic acid molecule comprising a nucleotide sequence 
encoding a polypeptide that exhibits an enzymatic activity of a ferredoxin, wherein the nucleic 
acid molecule is isolated from a Streptomyces strain comprising a polypeptide that 
regioselectively oxidizes avermectin to 4"-keto-avermectin. Similarly, the invention provides 
a purified nucleic acid molecule comprising a nucleotide sequence encoding a polypeptide 
that exhibits an enzymatic activity of a ferredoxin reductase, wherein the nucleic acid 
molecule is isolated from a Streptomyces strain comprising a polypeptide that regioselectively 
oxidizes avermectin to 4' 5 -keto- a vermec ti n . The invention also provides a purified protein 
that exhibits an enzymatic activity of a ferredoxin, as well as a purified protein that exhibits 
an enzymatic activity of a ferredoxin reductase, wherein the ferredoxin protein and the 
ferredoxin reductase protein are isolated from a Streptomyces strain comprising a polypeptide 
that regioselectively oxidizes avermectin to 4"-keto- avermectin. 

A useful nucleic acid molecule comprising a nucleotide sequence encoding a 
polypeptide that exhibits an enzymatic activity of a ferredoxin comprises or consists 
essentially of a nucleic acid sequence that is at least 81% identical to SEQ ID NO:35 or SEQ 
ID NO:37. Alternatively, the nucleic acid molecule comprises or consists essentially of a 
nucleic acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% 
identical to SEQ ID NO:35 or SEQ ID NO:37. The nucleic acid molecule comprising a 
nucleotide sequence encoding a polypeptide that exhibits an enzymatic activity of a ferredoxin 
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may comprise or consist essentially of the nucleic acid sequence of SEQ ID NO:35 or SEQ ID 
NO:37. 

The protein of the invention exhibiting a ferredoxin activity may comprise or consist 
essentially of an amino acid sequence that is at least 80% identical to SEQ ID NO:36 or SEQ 
ID NO:38. In some embodiments, the nucleic acid molecule comprises or consists essentially 
an amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% 
identical to SEQ ID NO:36 or SEQ ID NO:38. The ferredoxin of the invention may comprise 
or consist essentially of the amino acid sequence of SEQ ID NO:36 or SEQ ID NO:38. 

A useful nucleic acid molecule comprising a nucleotide sequence encoding a protein of 
the invention exhibiting a ferredoxin reductase comprises or consists essentially of the nucleic 
acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% identical to 
SEQ ID NO:98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104. In a particular 
embodiment of the invention, the nucleic acid molecule encoding a ferredoxin reductase of 
the invention may comprise or consist essentially of the amino acid sequence of SEQ ID 
NO:98, SEQ ID NO: 100, SEQ ID NO: 102, or SEQ ID NO: 104. 

The ferredoxin reductase of the invention may comprise or consist essentially of the 
amino acid sequence that is at least 85%, or at least 90%, or at least 95%, or at least 99% 
identical to SEQ ID NO:99, SEQ ID NO:101, SEQ ID NO:103, or SEQ ID NO:105. In a 
particular embodiment of the invention, the ferredoxin reductase of the invention may 
comprise or consist essentially of the amino acid sequence of SEQ ID NO:99, SEQ ID 
NO: 101, SEQ ID NO: 103, or SEQ ID NO: 105. 

Methods for purifying ferredoxin and ferredoxin reductase proteins and nucleic acid 
molecules encoding such ferredoxin and ferredoxin reductase proteins are known in the art 
and are the same as those described above for purifying P450 monooxygenases of the 
invention and nucleic acid molecules encoding P450 monooxygenases of the invention. 

In one non-limiting example to obtain a purified P450 monooxygenase of the invention 
with a purified ferredoxin, a S. lividans strain (or P. putida strain, or any other cell in which 
the P450 monooxygenase of the invention does not naturally occur) may be genetically 
engineered to contain a first nucleic acid molecule encoding a P450 monooxygenase of the 
invention and a second nucleic acid molecule encoding a ferredoxin protein, where both the 
first and second nucleic acid molecules are positioned for expression in the genetically 
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engineered cell. The first and the second nucleic acid molecules can be on separate plasmids, 
or can be on the same plasmid. Thus, the same engineered cell or strain will produce both the 
P450 monooxygenase of the invention and the ferredoxin protein of the invention. 

In a further non-limiting example to obtain a purified P450 monooxygenase of the 
invention with a purified ferredoxin and with a purified ferredoxin reductase of the invention, 
a S. lividans strain (or P. putida strain, or any other cell in which the P450 monooxygenase of 
the invention does not naturally occur) may be genetically engineered to contain a first nucleic 
acid molecule encoding a P450 monooxygenase of the invention and a second nucleic acid 
molecule encoding a ferredoxin protein of the invention and a third nucleic acid molecule 
encoding a ferredoxin reductase protein of the invention, where all the first and second and 
third nucleic acid molecules are positioned for expression in the genetically engineered cell. 
The first and the second and the third nucleic acid molecules may be provided on separate 
plasmids, or on the same plasmid. Thus, the same engineered cell or strain will produce all 
the P450 monooxygenase of the invention and the ferredoxin and the ferredoxin reductase 
proteins of the invention. 

As described above for the P450 monooxygenases of the invention, the ferredoxin 
protein and/or the ferredoxin reductase protein may further comprise a tag. Moreover, the 
invention contemplates binding agents {e.g., antibodies) that specifically bind to the 
ferredoxin protein, and binding agents that specifically bind to the ferredoxin reductase 
proteins of the invention. Methods for generating tagged ferredoxin protein, tagged 
ferredoxin reductase protein, and binding agents (e.g., antibodies) that specifically bind to 
ferredoxin or ferredoxin reductase are the same as those as described above for generating 
tagged P450 monooxygenases of the invention and generating binding agents that specifically 
bind P450 monooxygenases of the invention. 

The invention also provides a method for making emamectin. In this method, a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-aveimectin is added to a 
reaction mixture containing avermectin. The reaction mixture is then incubated under 
conditions that allow the P450 monooxygenase to regioselectively oxidize avermectin to 4"- 
keto-avermectin. The reaction mixture may further comprise a ferredoxin, such as a 
ferredoxin of the present invention. In particular embodiments, the reaction mixture further 
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comprises a ferredoxin reductase such as a ferredoxin of the present invention. The reaction 
mixture may further comprise a reducing agent, such as NADH or NADPEL 

Additionally, the invention provides a method for making 4"-keto-avermectin. The 
method comprises adding a P450 monooxygenase that regioselectively oxidizes avermectin to 
4"-keto-avermectin to a reaction mixture comprising avermectin and incubating the reaction 
mixture under conditions that allow the P450 monooxygenase to regioselectively oxidize 
avermectin to 4"-keto-avermectin. In some embodiments, the reaction mixture further 
comprises a ferredoxin, such as a ferredoxin of the present invention. The reaction mixture 
may also further comprise a ferredoxin reductase such as a ferredoxin of the present invention. 
In particular embodiments, the reaction mixture further comprises a reducing agent, such as 
NADHorNADPH. 

The invention also provides a formulation for making emamectin comprising a P450 
monooxygenase that regioselectively oxidizes avermectin to 4"-keto-avermectin. In some 
embodiments, the formulation further comprises a ferredoxin, such as a ferredoxin of the 
present invention. In particular embodiments, the ferredoxin is isolated from the same species 
of cell or strain from which the P450 monooxygenase was isolated or derived. The 
formulation may further comprise a ferredoxin reductase , such as a ferredoxin reductase of 
the present invention. In particular embodiments, the ferredoxin reductase is isolated from the 
same species of cell or strain from which the P450 monooxygenase was isolated or derived. . 
In some embodiments, the formulation further comprises a reducing agent, such as NADH or 
NADPH. 

In addition, the invention provides a formulation for making 4"-keto-avermectin 
comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4"-keto- 
avermectin. In some embodiments, the formulation further comprises a ferredoxin, such as a 
ferredoxin of the present invention. In particular embodiments, the ferredoxin is isolated from 
the same species of cell or strain from which the P450 monooxygenase was isolated or 
derived. In some embodiments, the formulation further comprises a ferredoxin reductase, 
such as a ferredoxin reductase of the present invention. In particular embodiments, the 
ferredoxin reductase is isolated from the same species of cell or strain from which the P450 
monooxygenase was isolated or derived. The formulation may further comprise a reducing 
agent, such as NADH or NADPH. 
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The following examples are intended to further illustrate certain preferred embodiments 
of the invention and are not limiting in nature. 

EXAMPLE I 

Optimized Growth Conditions for Streptomyces tubercidicus Strain R-922 

In one non-limiting example the fermentation conditions needed to provide a steady 
supply of cells of Streptomyces tubercidicus strain R-922 highly capable of regioselectively 
oxidizing avermectin to 4"-keto-avermectin were optimized. 

First, the following solutions were made. For ISP-2 agar, 4 g of yeast extract 
(commercially available from Oxoid Ltd, Basingstoke, UK), 4 g of D(+)-glucose, 10 g of 
bacto malt extract (Difco No. 0186-17-7 (Difco products commercially available from, e.g., 
Voigt Global Distribution, Kansas City, MO)), and 20 g of agar (Difco No. 0140-01) were 
dissolved in one liter of demineralized water, and the pH is adjusted to 7.0. The solution was 
sterilized at 121 °C for 20 min., cooled down, and kept at 55°C for the time needed for the 
immediate preparation of the agar plates. 

For PHG medium, 10 g of peptone (Sigma 0521; commercially available from Sigma 
Chemical Co., St. Louis, MO), 10 g of yeast extract (commercially available from Difco), 10 g 
of D-(+)-glucose, 2 g of NaCl, 0.15 g of MgS0 4 x 7 H 2 0, 1.3 g of NaH 2 P0 4 x H 2 0, and 4.4 g 
of K2HPO4 were dissolved in 1 liter of demineralized water, and the pH was adjusted to 7.0. 

Streptomyces tubercidicus strain R-922 was grown in a Petri dish on ISP-2 agar at 28°C. 
This culture was used to inoculate four 500 ml shaker flasks with a baffle, each containing 
100 ml PHG medium. These pre-cultures were grown on an orbital shaker at 120 rpm at 28°C 
for 72 hours and then used to inoculate a 10-liter fermenter equipped with a mechanical stirrer 
and containing 8 liters of PHG medium. This main culture was grown at 28°C with stirring at 
500 rpm and with aeration of 1.75 vvm (14 1/min.) and a pressure of 0.7 bar. At the end of the 
exponential growth, after about 20 hours, the cells were harvested by centrifugation. The yield 
of wet cells was 70-80 g/1 culture. 

EXAMPLE n 
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Whole Cell Biocatalvsis Assay 

As determined in accordance with the present invention, the following whole cell 
biocatalysis assay was employed to determine that the activity from Streptomyces cells 
capable of regioselectively oxidizing avermectin to 4"-keto-avermectin is catalyzed by a P450 
monooxygenase. 

Streptomyces tubercidicus strain R-922 was grown in PHG medium, and Streptomyces 
tubercidicus strain 1-1529 was grown in M-17 or PHG medium. PHG medium contains 10 g/1 
Peptone (Sigma, 0.521), 10 g/1 Yeast Extract (Difco, 0127-17-9), 10 g/1 D-Glucose, 2 g/1 
NaCl, 0.15 g/1 MgS0 4 x 7 H 2 0, 1.3 g/1 NaH 2 P0 4 x 1 H 2 Q, and 4.4 g/1 K 2 HPQ 4 at pH 7.0. M- 
17 medium contains 10 g/1 glycerol, 20 g/1 Dextrin white, 10 g/1 Soytone (Difco 0437-17), 3 
g/1 Yeast Extract (Difco 0127-17-9), 2 g/1 (NHO2SO4, and 2 g/1 CaC0 3 at pH 7.0 

To grow the cells, an ISP2 agar plate (not older than 1-2 weeks) was inoculated and 
incubated for 3-7 days until good growth was achieved. Next, an overgrown agar piece was 
transferred (with an inoculation loop) to a 250ml Erlenmeyer flask with 1 baffle containing 50 
ml PHG medium. This pre-culture is incubated at 28°C and 120 rpm for 2-3 days. Next, 5 ml 
of the pre-culture were transferred to a 500 ml Erlenmeyer flask with 1 baffle containing 100 
ml PHG medium. The main culture was incubated at 28°C and 120 rpm for 2 days. Next, the 
culture was centrifuged for 10 min. at 8000 rpm on a Beckman Rotor JA-14. The cells were 
next washed once with 50 mM potassium phosphate buffer, pH 7.0. 

To perform the whole cell biocatalysis assay, 500 mg wet cells were placed into a 25 
ml Erlenmeyer flask, to which were added 10 ml of 50 mM potassium phosphate buffer, pH 
7.0. The cells were stirred with a magnetic stir bar to distribute the cells. Next, 15 \i\ of a 
solution of avermectin Bla in isopropanol (30 mg/ml) were added, and the mixture shaken on 
an orbital shaker at 160 rpm and 28°C. Strain R-922 was reacted for 2 hours, and strain I- 
1529 was reacted for 30 hours. 

To work up the cultures in the whole cell biocatalysis assay, 10 ml methyl-t-butyl-ether 
was added to an Erlenmeyer flask containing the resting cells and the entire cell mixture was 
transferred to a 30 ml-centrifuge tube, shaken vigorously, and then centrifuged at 16000 rpm 
for 10 min. The ether phase was pipetted into a 50 ml pear flask, and evaporated in vacuo by 
means of a rotary evaporator (<0.1 mbar). The residue was re-dissolved in 1.2 ml acetonitrile 
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and transferred to an HPLC-sample viaL The conversion of avermectin Bla to 4"-hydroxy- 
avermectin Bla and 4"-keto-avermectin Bla (also called 4"-oxo-avermectin Bla) and the 
formation of a side product from the biocatalysis reaction could be observed by HPLC 
analysis using HPLC protocol I. 

For HPLC protocol I, the following parameters were used: 



Hardware 
Pump: 

Autosampler: 
Interface Module: 
Channel 1 -Detector: 
Column Oven: 
Column: 
Adsorbent: 



L-6250 Merck-Hitachi 

AS-2000A Merck-Hitachi 

D-6000 Merck-Hitachi 

L-7450A UV-Diode Array Merck-Hitachi 

none 

70mm x 4mm 

Kromasil 100A-3.5|a-C18 



Gradient Mode: 


Low 






Pressure Limit: 


5-300bar 






Column Temperature 


ambient (~20°C) 




Solvent A: 


acetonitrile 






Solvent B: 


water 






Flow: 


1.5 ml/min 






Detection: 


243 nm 






Pump Table: 


0.0 min 


75% A 


25% B 


linear gradient 


7.0 min 


100% A 


0%B 




9.0 min 


100% A 


0%B 


jump 


9.1 min 


75% A 


25% B 




12.0 min 


75% A 


25% B 


Stop time: 


12 min 






Sampling Period: 


every 200 msec 




Retention time table: 


time 


References 





2.12 min 4"-hydroxy- avermectin Bla 
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3 .27 min avermectin B 1 a 

3 .77 min 3"-0-demethyl-4"-keto-avermectin B 1 a 
4.83 min 4"-keto-avermectin B 1 a 



EXAMPLE m 

Biotransformation With Cell-Free Extract From Streptomyces Strain R-922 



To prepare an active cell-free extract from Streptomyces tubercidicus strain R-922 
capable of regioselective oxidation of avermectin to 4" -keto- avermectin, the following 
solutions were made, stored at 4°C, and kept on ice when used. 



Solution 


Formula 


PP-buffer 


50 mM K 2 HP04/KH 2 P04 (pH 7.0) 


Disruption buffer 


50 mM K2HPO4/KH2PO4 (pH 7.0), 5 mM benzamidine, 2 mM 
dithiothreitol, and 0.5 mM Pefabloc (from Roche Diagnostics) 


Substrate 


10 mg avermectin were dissolved in 1 ml isopropanol 



Six grams of wet cells from Streptomyces strain R-922 were washed in PP-buffer and 
then resuspended in 35 ml disruption buffer and disrupted in a French press at 4°C. The 
resulting suspension was centrifuged for 1 hour at 35000 x g. The supernatant of the cell free 
extract was collected. One jli] substrate was added to 499jLil of cleared cell free extract and 
incubated at 30°C for 1 hour. Then, 1 ml methyl-t-butyl ether was added to the reaction 
mixture and thoroughly mixed. The mixture was next centrifuged for 2 min. at 14000 rpm, 
and the methyl-t-butyl ether phase was transferred into a 10 ml flask and evaporated in vacuo 
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by means of a rotary evaporator. The residue was dissolved in 200 \xl acetonitrile and 
transferred into an HFLC-sample vial. 

For HPLC, the HPLC protocol I was used. 

When 1 \x\ substrate was added to 499 |lx1 of cleared cell free extract and incubated at 
30°C, no conversion of avermectin to 4"-keto-avermectin was observed by HPLC analysis 
using HPLC protocol I. 

However, the possibility of addition of spinach ferredoxin and spinach ferredoxin 
reductase and NADPH to the cell free extract to restore the biocatalytic activity was explored 
(see, generally, D.E. Cane and E.I. Graziani, /. Amen Chem. Soc. 120:2682, 1998). 
Accordingly, the following solutions were made: 



Solution 


Formula 


Substrate 


10 mg avermectin were dissolved in 1 ml isopropanol 


Ferredoxin 


5 mg ferredoxin (from spinach), solution 1-3 mg/ml in Tris/HCl-buffer 
(from Fluka) 

or 5 mg ferredoxin (from Clostridium pasteurianum), solution of 1-3 
mg/ml in Tris/HCl-buffer (from Fluka) 

or 5 mg ferredoxin (from Porphyra umbilicalis), solution of 1-3 
mg/ml in Tris/HCl-buffer (from Fluka) 


Ferredoxin Reductase 


1 mg freeze-dried ferredoxin reductase (from spinach), solution of 3.9 
U/mg in 1 ml H 2 0 (from Sigma) 


NADPH 


100 mM NADPH in H 2 0 (from Roche Diagnostics) 



The substrate solution was stored at 4°C, the other solutions were stored at -20°C, and kept on 
ice when used. 



Thus, to 475 |Lil of cleared cell free extract the following solutions were added: 10 \i\ 
ferredoxin, 10 \x\ ferredoxin reductase and 1 \x\ substrate. After the addition of substrate to the 
cells, the mixture was immediately and thoroughly mixed and aerated. Then, 5 |Lil of NADPH 
were added and the mixture incubated at 30°C for 30 min. Then, 1 ml methyl -t-butyl ether 
was added to the reaction mixture and thoroughly mixed. The mixture was next centrifuged 
for 2 min. at 14000 rpm, and the methyl-t-butyl ether phase was transferred into a 10 ml flask 
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and evaporated in vacuo by means of a rotary evaporator. The residue was dissolved in 200 |il 
acetonitrile and transferred into an HPLC-sample vial, and HPLC analysis performed using 
HPLC protocol I. 

Formation of 4"-keto-avermectin was observable by HPLC analysis. Thus, addition of 
spinach ferredoxin and spinach ferredoxin reductase and NADPH to the cell free extract 
restored the biocatalytic activity. 

Upon injection of a 30 jxl sample, a peak appeared at 4.83 ruin., indicating the presence 
of 4"-keto-avermectin Bla. A mass of 870 D could be assigned to this peak by HPLC-mass 
spectrometry which corresponds to the molecular weight of 4"-keto-avermectin Bla. 

Note that when analyzing product formation by HPLC and HPLC -mass spectrometry, in 
addition to the 4"-keto-avermectin, the corresponding ketohydrate 4' T -hydroxy-avermectin was 
also found giving a peak at 2.12 min. This finding indicated that the P450 monooxygenase 
converts avermectin by hydroxylation to 4"-hydroxy-avermectin, from which 4"-keto~ 
avermectin is formed by dehydration. Interestingly, when the spinach ferredoxin was replaced 
by ferredoxin from the bacterium Clostridium pasteurianum or from the red alga Porphyra 
umbilicalis, the biocatalytic conversion of avermectin to 4"-keto-avermectin still took place, 
indicating that the enzyme does not depend on a specific ferredoxin for receiving reduction 
equivalents. 

EXAMPLE IV 

Isolation of a Mutant Streptomyces Strain R-922 With Enhanced Activity 

To obtain strains of Streptomyces strain R-922 that have an enhanced ability to 
regioselectively oxidize avermectin to 4"-keto-avermectin, UV mutants were generated. To 
do this, spores of Streptomyces strain R-922 were collected and stored in 15% glycerol at - 
20°C. This stock solution contained 2xl0 9 spores. 

The spore stock solution was next diluted and transferred to petri plates containing 10ml 
of sterile water, and the suspension was exposed to UV light in a Stratalinker UV 
crosslinker 2400 (commercially available from Stratagene, La Jolla, CA). The Stratalinker 
UV crosslinker uses a 254-nm light source and the amount of energy used to irradiate a 
sample can be set in the "energy mode." 
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Applicant's or agent's 




International applicationNo. 


file reference 


PB/5-6001 6A 


PCT/EP 02/05363 



INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
OR OTHER BIOLOGICAL MATERIAL 

(PCT Rule 13bis) 



A. The indications made below relate to the deposited microorganism or other biological material referred to in the description 
on page 41 t l me 1-7 . 



B, IDENTIFICATION OF DEPOSIT Further deposits are identified on an additional sheet 



Name of depositary institution 

Agricultural Research Service, Patent Culture Collection (NRRL) 



Address of depositary institution (including postal code and country) 

1815 North University Street 
Peoria 

Illinois 61604 
USA 



Date of deposit 



May 08, 2001 



Accession Number 



NRRL B-30479 



C. ADDITIONAL INDICATIONS (leave blank if not applicable) This information is continued on an additional sheet [ ] 



Pseudcmonas putida NRRL B-4067 containing plasmid pRK290-ema1/fd233 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indications are not for all designated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later (specify the general nature of the indications e.g., "Accession 
Number of Deposit") 



For receiving Office use only 



j | This sheet was received with the international application 



Authorized officer 



| XI This sheet was received by the International Bureau on: 

0 2 SEP 2002 


Authorized officer 





Form PCT/RO/134 (Julyl998) 
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file reference 
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INDICATIONS RELATING TO DEPOSITED MICROORGANISM 
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Through experimentation, it was determined that an exposure of 8000 microjoules of 
UV irradiation (254 nm) was required to kill 99.9% of the spores. This level of UV exposure 
was used in the mutagenesis. 

Surviving UV-mutagenized spores were plated, cultured, and transferred to minimal 
media. Approximately 0.3-0.4% of the viable spores were determined to be auxotrophic, 
indicating a good level of mutagenesis in the population. 

The mutagenized clones were screened for activity in the whole cell biocatalysis assay 
described in Example II. As shown in an HPLC chromatogram, one mutant ("R-922 UV 
mutant") showed a two to three fold increase in an ability to regioselectively oxidize 
avermectin to 4"-keto-avermectin as compared to wild-type strain R-922. Although the gene 
encoding the P450 monooxygenase responsible for the regioselectively oxidation activity, 
emal, is not mutated in the R-922 UV mutant, this mutant nonetheless provides an excellent 
source for a cell-free extract containing emal protein. 

EXAMPLE V 

Isolation of the P450 Monooxygenase from Streytomvces Strain R-922 

To enrich the P450 enzyme, 35 ml of active cell free extract were filtered through a 45 
{xm filter and fractionated by anion exchange chromatography. Anion exchange 
chromatography conditions were as follows: 
FPLC instrument: Akta prime (from Pharmacia Biotech) 

FPLC -column: HiTrap™Q (5 ml) stacked onto Resource® Q (6 ml) (from Pharmacia 
Biotech) 

eluents buffer A: 25 mM Tris/HCl (pH 7.5) 

buffer B: 25 mM Tris/HCl (pH 7.5) containing 1 M KC1 

temperature eluent bottles and fractions in ice bath, 

flow 3 ml/min 

detection UV 280nm 

Pump table: 0.0 min 100% A 0% B 

linear gradient to2.0 min 90% A 1 0% B 

5.0 min 90% A 10% B 
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linear gradient to30.0 man 50% A 50% B 
linear gradient toAO.Q min 0% A 100% B 
SO.Omin 0% A 100% B 



Enzyme activity eluted with 35%-40% buffer B. The active fractions were pooled and 
concentrated by centrifugal filtration through Biomax™ filters with an exclusion limit of 5kD 
(commercially available from Millipore Corp., Bedford, MA) at 5000 rpm and then rediluted 
in disruption buffer containing 20% glycerol to a volume of 5 ml containing 3-10 mg/ml 
protein. This enriched enzyme solution contained at least 25% of the original enzyme activity. 

The enzyme was further purified by size exclusion chromatography. Size exclusion 
chromatography conditions were as follows: 
FPLC instrument: Akta prime (from Pharmacia Biotech) 

HiLoad 26/60 Superdex® 200 prep grade (from Pharmacia Biotech) 
3-5 ml enriched enzyme solution from the anion chromatography step 
filtered through 45 jLim filter 
PP-buffer (pH 7.0) + 0.1 M KC1 
4°C 

2 ml/min 
UV 280nm 



FPLC-column: 
sample: 

sample preparation: 

eluent buffer: 

temperature: 

flow: 

detection: 



Enzyme activity eluted between 205-235 ml eluent buffer. The active fractions were 
pooled, concentrated by centrifugal filtration through Biomax™ filters with an exclusion limit 
of 5 kD (from Millipore) at 5000 rpm, and rediluted in disruption buffer containing 20% 
glycerol to form a solution of 0.5-1 ml containing 2-5mg/ml protein. This enriched enzyme 
solution contained 10% of the original enzyme activity. This enzyme preparation, when 
checked for purity by SDS page, {see, generally, Laemmli, U.K., Nature 227:680-685, 1970 
and Current Protocols in Molecular Biology , supra) and stained with Coomassie blue, showed 
one dominant protein band with a molecular weight of 45-50 kD, according to reference 
proteins of known molecular weight. 



EXAMPLE VI 
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Attempted Isolation of P450 Monooxygenase Genes 
From Strevtomyces Strains R-922 and 1-1529 

Based on results described above that suggested the enzyme from strain R-922 that is 
responsible for the regiospecific oxidation of avermectin to 4"-keto-avermectin is a P450 
monooxygenase, a direct PCR-based approach to clone P450 monooxygenase genes from this 
strain was initiated {see, generally, Hyun et ah, J. Microbiol BiotechnoL 8(3):295-299, 1998). 
This approach is based on the fact that all P450 monooxygenase enzymes contain highly 
conserved oxygen-binding and heme-binding domains that are also conserved at the 
nucleotide level. PGR primers were designed to prime to these conserved domains and to 
amplify the DNA fragment from P450 genes using R-922 or 1-1529 genomic DNA as a 
template. The PCR primers used are shown in Table 1. 



Table 1 



0 2 -Binding 


Domain 


Primers 


(5' to 3')* 


Degeneracy 


SEQ ID NOs 


I 
ATC 


A 
GCS 


G H 
GGS CAC 


E T 
GAG ACS 


T 
AC 




8 


43 
44 


V 
GTS 


A 
GCS 


G H 
GGS CAC 


E T 
GAG ACS 


T 
AC 




16 


45 
46 


Lt 
CTS 


A 
GCS 


G H 
GGS CAC 


E T 
GAG ACS 


T 
AC 




16 


47 
48 


L 

TS 


L 
CTS 


L I 
CTS ATC 


A G 
GCS GGS 


H E T 
CAC GAG AC* 


32 


49 
50 








Heme-Binding Domain Primers (3' to 5')* 






H 
GTG 


Q 
GTC 


C 
ACG 


L 

GAS 


G 
CCS 


Q 
TGC 


N 
TTG 


L A 
GAS CG & 


8 


51 

52 


F 
AAG 


G 
CCS 


H 
GTG 


G 
CCS 


V 
CAS 


H 
GTG 


Q 
GTC 


C 
ACG 


8 


53 
54 


F 
AAG 


G 
GCS 


F 
AAG 


G 
CCS 


V 
CAS 


H 
GTG 


Q 
GTC 


C 
ACG 


8 


55 
56 


F 
AAG 


G 
CCS 


H 
GTG 


G 
CCS 


I 
TAG 


H 
GTG 


Q 
GTC 


C 
ACG 


4 


57 
58 
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F 


G 


H 


G 


V 


H 


F 


C 




59 


AAG 


CCS 


GTG 


CCS 


CAS 


GTG 


AAG 


ACG 


8 


60 



* The amino acid sequence is shown on the top line and the corresponding nucleotide 
sequence is shown below on the second line; S=G or C. 
& This primer was described by Hyun et al, supra 



PCR amplification using any of the primers specific to nucleotide sequences encoding 
the 0 2 ~binding domain with any of the primers specific to the nucleotide sequences encoding 
the heme-binding domain and genomic DNA from Streptomyces strains R-922 or 1-1529 
resulted in the amplification of an approximately 350 bp DNA fragment. This is exactly the 
size that would be expected from this PCR amplification due to the approximately 350 bp 
separation in P450 genes of the gene segments encoding the 0 2 -binding and heme-binding 
sites. 

The 350 bp PCR fragments were cloned into the pCR2.1-TOPO TA cloning plasmid 
(commercially available Invitrogen, Carlsbad, CA) and transformed into E, colt strain TOP10 
(Invitrogen, Carlsbad, CA). Approximately 150 individual clones from strains R-922 and I- 
1529 were sequenced to determine how many unique P450 gene fragments were represented. 
Analysis of the sequences revealed that they included 8 unique P450 gene fragments from 
strain R-922 and 7 unique fragments from 1-1529. 

Blast analysis (alignment of the deduced amino acid sequences of P450 gene-specific 
PCR fragments derived from Streptomyces tubercidicus strain R-922 and Streptomyces strain 
1-1529, respectively, and the P450 monooxygenase from S. thermotolerans that is involved in 
the synthesis of carbomycin (Stol-ORFA) (GenBank Accession No. D30759) by the program 
Pretty from the University of Wisconsin Package version 10.1 (Altschul et aL, Nucl. Acids 
Res. 25:3389-3402). demonstrated that all of the unique P450 gene fragments from both the 
R-922 and 1-1529 strains were derived from P450 genes and encoded the region between the 
Cb-binding and heme-binding domains. 

Next, in order to clone the full-length genes from which the PCR fragments were 
derived, the DNA fragments cloned by PCR were used as hybridization probes to gene 
libraries containing genomic DNA from strains R-922 and 1-1529. To do this, genomic DNA 
from the R-922 and 1-1529 strains was partially digested with Sau3A I, dephosphorylated with 
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calf intestinal alkaline phosphatase (CIP) and ligated into the cosmid pPEH215, a modified 
version of SuperCos 1 (commercially available from Stratagene, La Jolla, CA). Ligation 
products were packaged using the Gigapack III XL packaging extract and transfected into E. 
coli XL1 Blue MR host cells. Twelve cosmids that strongly hybridized to the PCR-generated 
P450 gene fragments were identified from the R-922 library, from which three unique P-450 
genes were subcloned and sequenced. The hybridizations were performed at high stringency 
conditions according to the protocol of Church and Gilbert (Church and Gilbert, Proc. Natl 
Acad. Set USA 81:1991-1995, 1984). In brief, these high stringency conditions include 
Hybrid Buffer containing 500 mM Na-phosphate, 1 mM EDTA, 7% SDS, 1% BSA; Wash 
Buffer 1 containing 40 mM Na-phosphate, 1 mM EDTA, 5% SDS, 0.5% BSA; and Wash 
Buffer 2 containing 40 mM Na-phosphate, 1 mM EDTA, 1% SDS (Note that other high 
stringency hybridizations conditions are described, for example, in Current Protocols in 
Molecular Biology , supra.) Nineteen strongly hybridizing cosmids were identified from the I- 
1529 library, and from these, four unique P-450 genes were subcloned and sequenced. 

In yet a further approach to isolate diverse P450 monooxygenase genes from strains R- 
922 and 1-1529, a known P450 gene from another bacterium was used as a hybridization 
probe to identify cosmid clones containing homologous P450 genes from strains R-922 and I- 
1529. The epoF P450 gene from Sorangium cellulosum strain So ce90 that is involved in the 
synthesis of epothilones (Molnar et al, Chem Biol. 7(2):97-109, 2000) was used as a probe in 
this effort. Using the epoF P450 gene probe, one cosmid was identified from strain R-922 
(clone LC), and threewere identified from strain 1-1529 (clones LA, LB, and EA). In each 
case, the homologous gene fragment was subcloned and sequenced, and found to code for 
P450 monooxygenase enzymes. 

However, a comparison of the 17 polypeptide sequences identified in Example VII 
(below) failed to match any of these cloned genes. Two of the polypeptide sequences 
(namely, LVKDDPALLPR and AVHELMR) mapped to the region between the 0 2 and heme 
binding domains, and so these should have identified any of the partial gene fragments derived 
by the PGR approach. Thus, the standard approaches based on the known PGR technique of 
Hyun et al., supra, and using known P450 genes as hybridization probes failed to identify the 
gene that encodes the specific P450 monooxygenase responsible for the regioselective 
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oxidation of avermectin. Accordingly, it was determined that additional experimentation was 
required to isolate the gene encoding the P450 monooxygenase of the invention. 

EXAMPLE Vn 

Partial Sequencing of the P450 Monooxygenase from Streptomyces Strain R-922 

Partial amino acid sequencing of the P450 monooxygenase from Streptomyces strain R- 
922 was carried out by the Friedrich Miescher Institute, Basel Switzerland. The protein of the 
dominant band on the SDS page was tryptically digested and the formed peptides separated 
and sequenced by mass spectrometry and Edman degradation {see, generally, Zerbe-Burkhardt 
et al, 7. Biol Chem. 273:6508, 1998). The sequence of the following 17 peptides were found: 

Sequence Sequence LP. No. 



HPGEPNVMDPALITDPFTGYGALR 


(SEQ ID NO:61) 


FVNNPASPSLNYAPEDNPLTR 


(SEQ ID NO:62) 


LLTHYPDISLGIAPEHLER 


(SEQ ID NO:63) 


VYLLGSILNYDAPDHTR 


(SEQ ID NO:64) 


TWGADLISMDPDR 


(SEQ ID NO:65) 


E ALTDDLLS ELBR. 


(SEQ ID NO:66) 


FMDDSPVWLVTR 


(SEQ ID NO:67) 


LMEMLGLPEHLR 


(SEQ ID NO:68) 


VEQIADALLAR 


(SEQ ID NO:69) 


LVKDDPALLPR 


(SEQ ID NO:70) 


DDPALLPR 


(SEQ ID NO:71) 


TPLPGNWR 


(SEQ ID NO:72) 


LNSLPVR 


(SEQ ID NO:73) 


ITDLRPR 


(SEQ ID NO:74) 


EQGPVVR 


(SEQ ID NO:75) 


AVBELMR 


(SEQ ID NO:76) 
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AFTAR (SEQ ID NO:77) 

FEEVR (SEQ ID NO:78) 

Alignment of these peptides to a selection of actinomycete P450 monooxygenase 
sequences indicated that all the peptides were fragments of a single P450 mono-oxygenase. 

EXAMPLE Vffl 

Cloning the P450 Monooxygenase Gene from Strain R-922 that Encodes the Enzyme 
Responsible for the Oxidation of Avermectin to 4"-Keto-Avermectin 

PCR primers were designed by reverse translation from the amino acid sequences of 
several of the peptides derived from the P450 enzyme of strain R-922 {see Example VII and 
Table 2 below). Each of five forward primers (2aF, 2bF, 3F, IF, and 7F) was paired with one 
reverse primer (5R) in PCR reactions with R-922 genomic DNA as a template. In each 
reaction, a DNA fragment of the expected size was produced. 



Table 2 



Primer 


Primer 


sequence and the amino acid 


Degen- 


Expected 


SEQ 




sequence to which 


they 


were designed* 


eracy 


size 


ID 




















(bp) ** 


NO: 


2aF 


P 


G 


E 


D 


N 


V 


M 


64 


600 


79 




5' -CCS 


GGS 


GAR 


CCS 


AAY 


GTS 


ATG- 3 ' 






80 


2bF 


A 


L 


I 


T 


D 


P 


F 


32 


580 


81 




5 ' -GCS 


CTS 


ATY 


ACS 


GAC 


CCS 


TTC- 3 ' 






82 


3F 


F 


M 


D 


D 


S 


p 


V W 


32 


549 


83 




5 ' -TTC 


ATG 


GAC 


GAC 


WSS 


CCS 


GTS TGG-3 ' 






84 


IF 


L 


N 


Y 


D 


A 


p 


D H 


32 


350 


85 




5 ' -CTS 


AAY 


TAY 


GAC 


GCS 


CCS 


GAC CAC-3' 






86 


7F 


V 


E 


Q 


I 


A 


D 


A L 


32 


300 


87 




5 ' -GTS 


GAR 


CAG 


ATY 


GCS 


GAC 


GCS CTS-3' 






88 


5R 


D 


L 


I 


S 


M 


D 


P D 


64 




89 




3 ' -CTG 


GAS 


TAR 


WSS 


TAC 


CTG 


GGS CTG- 5' 






90 



Ambiguity codes: Y=C or T; R=A or G; S=C or G; W=A or T 
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** Expected size of PCR product when the primer is when paired with 
primer 5R 

The 580 and 600 bp PCR fragments generated by using primers (2bF and 5R) and (2aF 
and 5R), respectively, were cloned into the pCR-Blunt II-TOPO cloning plasmid 
(commercially available from Invitrogen, Carlsbad, CA) and transformed into E, coli strain 
TOP10 (Invitrogen, Carlsbad, CA). The inserted DNA fragments were then sequenced. 
Examination of the sequences revealed that the 600 and 580 bp fragments were identical in 
the 580 bp of sequence that they have in common. Also, there was a perfect match between 
the deduced amino acid sequence (SEQ ID NO:2) derived from the nucleotide sequence of the 
600 bp and 580 bp fragments and the amino acid sequences of peptides isolated from the 
purified P450 Ema i enzyme that aligned in this region of the isolated gene. This result strongly 
suggested that the gene fragments isolated in these clones are derived from the gene that 
encodes the P450 Em ai enzyme that is responsible for the oxidation of avermectin to 4"-keto- 
avermectin. 

The 600 bp PCR fragment produced using primers 2aF (SEQ ID No: 80) and 5R (SEQ 
ID No:90) was used as a hybridization probe to a cosmid library of genomic DNA isolated 
from strain R-922 (cosmid library described in Example VI). Two cosmids named pPEH249 
and pPEH250 were identified that hybridized strongly with the probe. The portion of each 
cosmid encoding the P450 enzyme was sequenced and the sequences were found to be 
identical between the two cosmids. The complete coding sequence of the emal gene was 
identified (SEQ ID NO:l). The amino acid sequence of all polypeptide fragments from 
P450 Em ai matched perfectly with the deduced amino acid sequence from the emal gene. 
Comparison of the deduced amino acid sequence of the protein encoded by the emal gene 
using BLASTP (Altschul et ah, supra) determined that the closest match in the databases is to 
a P450 monooxygenase from S. thermotolerans that has a role in the biosynthesis of 
carbomycin (Arisawa et ah, Biosci. Biotech. Biochem. 59(4):582-588, 1995) and whose 
identity with emal is only 49% (Identities = 202/409 (49%), Positives = 271/409 (65%), Gaps 
= 2/409 (0%)). In the Blast analysis, the following settings were employed: 
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BLASTP 2.0.10 
Lambda K H 

0.322 0.140 0.428 

Gapped 

Lambda K H 

0.270 0.0470 0.230 
Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 375001765 

Number of Sequences: 1271323 

Number of extensions: 16451653 

Number of successful extensions: 46738 

Number of sequences better than 10.0: 2211 

Number of HSP's better than 10.0 without gapping: 628 

Number of HSP's successfully gapped in prelim test: 1583 

Number of HSP's that attempted gapping in prelim test: 43251 

Number of HSP's gapped (non-prelim): 2577 

length of query: 43 0 

length of database: 409,691,007 

effective HSP length: 55 

effective length of query: 375 

effective length of database: 339,768,242 

effective search space: 127413090750 

effective search space used: 127413090750 



A similar comparison of the nucleotide sequences of these two genes demonstrated that 
they are 65% identical at the nucleotide level. These results demonstrate that P450 Em ai is a 
new enzyme. 

EXAMPLE IX 

Heterologous Expression of the emal Gene in Streptomvces lividans Strain ZX7 

The coding sequence of the emal gene was fused to the thiostrepton-inducible promoter 
(tipA) (Murakami et al, J. Bacteriol. 171:1459-1466, 1989). The tipA promoter was derived 
from plasmidpSIT151 (Herron and Evans, FEMS Microbiology Letters 171:215-221, 1999). 

The fusion of the tipA promoter and the emal coding sequence was achieved by first 
amplifying the emal coding sequence with the following primers to introduce a Pad cloning 
site at the 5' end and a Pmel compatible end on the 3' end. 

Forward Primer: The underlined sequence is a Pad recognition sequence; the 
sequence in bold-face type is the start of the coding sequence of emal. 

5 ' -AG A TTAATTAAT GTCGGAATTA ATG A ACTGTCdGTT-3 ' (SEQ ID NO:91) 
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Reverse Primer: The underlined sequence is half of a Pmel recognition 
sequence; the bold-face type sequence is the reverse complement of the emal 
translation stop codon followed by the 3' end of the emal coding sequence. 

5 ' - A A AC TC ACCCC A ACCGC ACCGGC AGCG AGTTC-3" (SEQ ED NO:92) 

The Pad-digested PCR fragment containing the emal coding sequence was cloned into 
plasmid pTBBKA (see Figure 1) that was restricted (i.e., digested) with Pad and Pmel, and 
the ligated plasmid transformed into E. coli. Four clones were sequenced. Three of the four 
contained the complete and correct emal coding sequence. The fourth emal gene clone 
contained a truncated version of the emal gene. The full-length emal gene encodes a protein 
that begins with the amino acid sequence MSELMNS (SEQ ID NO:93). The truncated gene 
encodes a protein that lacks the first 4 amino acids and begins with the second methionine 
residue. This gene has been named emal A. The nucleotide and amino acid sequence of 
emalA are provided as SEQ ID NO:33 and SEQ ID NO:34, respectively. The emal and 
emal A genes in these plasmids, pTBBKA-<?7?zai and pTBBKA-emaiA, are in the correct 
juxtaposition with the tipA promoter to cause expression of the genes from this promoter. 

Plasmid pTBBKA contains a gene from the Streptomyces insertion element IS 117 that 
encodes an integrase that catalyzes site-specific integration of the plasmid into the 
chromosome of Streptomyces species (Henderson et ah, MoL Microbiol 3:1307-1318, 1989 
and Lydiate et al, MoL Gen. Genet. 203:79-88, 1986). Since plasmid pTBBKA has only an 
E. coli replication origin and contains a mobilization site, it can be transferred from E. coli to 
Streptomyces strains by conjugation where it will not replicate. However, it is able to 
integrate into the chromosome due to the IS1 17 integrase and Streptomyces clones containing 
chromosomal integrations can be selected by resistance to kanamycin due to the plasmid- 
borne kanamycin resistance gene. 

The emal coding sequence was also cloned into other plasmids that are either 
replicative in Streptomyces or, like pTBBKA, integrate into the chromosome upon 
introduction into a Streptomyces host. For example, emal was cloned into plasmid pEAA, 
which is similar to plasmid pTBBKA but the KpnI/PacI fragment containing the tipA 
promoter was replaced with the ermE gene promoter (Schmitt-John and Engels, Appl 
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Microbiol Biotechnol 36(4):493-498, 1992). In addition, pEAA does not contain the 
kanamycin resistance gene. The emal gene was cloned into pEAA as a Pacl/Pmel fragment to 
create plasrnid pEAA-emal in which the emal gene is expressed from the constitutive ermE 
promoter. 

Plasrnid pTUAlA is a Streptomyces-E.coli shuttle plasrnid {see Figure 2) that contains 
the tipA promoter. The emal gene was also cloned into the Pacl/Pmel site in plasrnid 
pTUAlA to create plasrnid pTUA-eraai. 

The emal A gene fragment was also li gated as a Pacl/Pmel fragment into plasmids 
pTUAl A, and pEAA in the same way as the emal gene fragment to create plasmids pTUA- 
emal A, and pEAA-emalA, respectively. 

The pTBBKA, pTUAl A, and pEAA based plasmids containing the emal or emalA 
genes were introduced into 5. lividans ZX7 and in each case transformants were obtained and 
verified (S. lividans strains ZX7::pTBBKA-em<z7 or emal A, ZX7 {pTUA-emal or -emal A), 
and ZX7::pEAA-emal or -emal A, respectively). 

Wild-type Streptomyces lividans strain ZX7 was tested and found to be incapable of the 
oxidation of avermectin to 4"-keto-avermectin. Transformed S. lividans strains 
ZX7::pTBBKA-ema7, ZX7::pTBBKA-emaiA, ZX7 (pTUA-emai), ZX7 (pTUA-emaZA), 
ZX7 : :pBAA-emal , and ZX7 : ipEAA-emalA were each tested for the ability to oxidize 
avermectin to 4"-keto-avermectin using resting cells. To do this, the whole cell biocatalysis 
assay described above (including analysis method) was performed. Note that for the whole 
cell biocatalysis assay, transformed Streptomyces lividans, like strain R-922, was grown in 
PHG medium and, again like strain R-922, had a reaction time of 16 hours (Le. 9 during which 
time the 500 mg transformed Streptomyces lividans wet cells in 10 ml of 50 niM potassium 
phosphate buffer, pH 7.0, were shaken at 160 rpm at 28°C in the presence of 15 |til of a 
solution of avermectin in isopropanol (30 mg/ml)). 

In the presence of the inducer, thiostrepton (5 ug/ml), the emal- or eraaiA-containing 
strains ZX7::pTBBKA-emai, ZX7::pTBBKA-emaiA, ZX7 {pTVA-emal), ZX7 (pTUA- 
emal A) were found to oxidize avermectin to 4"-keto-avermectin as evidenced by the 
appearance of the oxidized 4"-keto-avermectin compound {see Table 3). 

Table 3 
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Jtseispiei i. otram 


% Conversion of Avermectin 




2 hour 


16 hour 


Sfr&nfninvf0Q Jivi/Jstrtv 4- T*l5*«miH^ 

»JM t/j/H/l/jytei) 1-1 F ItlLtltO JLJJ\*i T X Id Oil 111* 






None 


0 


0 


pTBBKA^m^7^i 


0.5 ±0.059 


1.17 ±0.112 


pTBBKA-emai 


0.21 ±0.0.356 


0.65 ±0.079 


pTUA-emai 


20.96 ±1.044 


42.0 ±2.5 


pEAA-^mai 


3.0 ±0.232 


24.1 ±0.358 


pTBBKA-ema2 


4.79 ±0.096 


9.57 ±0.423 


pTUA-ema2 


0.77 ±0.138 


2.05 ±0.537 


pEAA-6/??<22 


0.0 


1.73 ±3.00 


pTBBKA-emal/fd233 


8.89 ±0.720 


30.99 ±0.880 


rfTJJA-ema 1 Ifd23 ? 


23.29 ±0.854 


61.2 ±3.548 




8.26 ±0.845 


10.66 ±0.858 




1.85 ±0.861 


6.40 ±1.918 








Pseudotnonas putida S12 + Plasmid 






None 




0 


pRK-emai 


ND 2 


18 


pRK-<?mai//tf233 


ND 


32 



J pTBBKA= IS1 17 integrase, tipA promoter; pTUA= replicative plasmid, tipA promoter; 
pEAA= IS 117 integrase, ermE promoter 
2 Not Determined 



These results conclusively demonstrate that the P450 Ema i enzyme encoded by the emal gene 
is responsible for the oxidation of avermectin to 4"-keto-avermectin in S. tubercidicus strain 
R-922. Furthermore, the data demonstrates that the emal A gene that is 4 amino acids shorter 
on the N-terminus than the native emal gene also encodes an active P450 E mai enzyme. As 
can be demonstrated by HPLC analysis, oxidation of avermectin to 4"-keto-avermectin by 5. 
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lividans strain ZX7::pTBBKA-<?mai following induction of emal expression with 0, 0,5, or 
5.0 jxg/ml thiostrepton. is variable depending upon the amount of thiostrepton used to induce 
expression of emal. Note that S. lividans strains ZX7::pEAA-eraa7 and ZX7::pEAA-e?nalA 
(see Table 3) demonstrated this oxidation activity in the absence of thiostrepton since in these 
strains the emal or emal A genes are expressed from the ermE promoter that does not require 
induction. 

EXAMPLE X 

Isolation of an emal -Homologous Gene From Streptomyces tubercidicus Strain 1-1529 

Streptomyces tubercidicus strain 1-1529 was also found to be active in biocatalysis of 
avermectin to form the 4"-keto-avermectin derivative. The cosmid library from strain 1-1529, 
described in Example VI, was probed at the high stringency conditions of Church and Gilbert 
(Church and Gilbert, Proc. Natl Acad. Sci. USA 81:1991-1995, 1984) with the 600 bp emal 
PGR fragment produced using primers 2aF (SEQ ID No:80) and 5R (SEQ ID No:90) 
described previously to identify clones containing the emal homolog from strain 1-1529. 
Three strongly hybridizing cosmids were identified. The P450 gene regions in two of the 
cosmids, pPEH252 and pPEH253, were sequenced and found to be identical. Analysis of the 
DNA sequence revealed the presence of a gene with high homology to the emal gene of strain 
R-922. A comparison of the deduced amino acid sequence of Ema2 {i.e., P450 Ema 2), Emal 
(i.e., P450Emai) ? and a P450 monooxygenase from Streptomyces thermotolerans that is 
involved in the biosynthesis of carbomycin (Carb-450) (GenBank Accession No. D30759). 
demonstrated that all of the unique P450 gene fragments from both the R-922 and 1-1529 
strains were derived from P450 genes and encoded the region between the 02-binding and 
heme-binding domains. 

The gene from Streptomyces tubercidicus strain 1-1529, named emal, encodes an 
enzyme with 90% identity at the amino acid level and 90.6% identity at the nucleotide level to 
the P450Emai enzyme. The nucleotide sequence of the ema2 gene and the deduced amino acid 
sequence of P450 E ma2 are provided in SEQ ID NO:3 and SEQ ID NO:4, respectively. 

The ema2 coding sequence was cloned in the same manner as the emal and emal A 
genes into plasmids pTBBKA, pTUAlA, and pEAA such that the coding sequence was 
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functionally fused to the tipA or ermE* promoter in these plasrnids. The resulting plasmids, 
pTBBKA-£7na2, pTUA-^mcz2, and pEAA-ema2 were transferred from E. coli to S. lividans 
ZX7 by conjugation to create strains ZX7::TBBKA-ema2 and ZX7 (pTUA-em<z2), and 
ZX7::pEAA-ema2 containing the em#2 gene integrated into the chromosome or maintained 
on a plasmid. 

Strains ZX7::TBBKA-e;rca2, ZX7 (pTUA-ema2), and ZX7::pEAA-emo2 were next 
tested for the ability to oxidize avermectin to 4"-keto-avermectin. The ema2 gene was also 
shown to provide biocatalysis activity, although at a lower level compared to the emal gene 
{see Table 3). 

These results demonstrate that the emal gene from 5. tubercidicus strain 1-1529 also 
encodes a P450 enzyme (P450 Ema2 ) capable of oxidizing avermectin to 4"-keto-avermectin. 

EXAMPLE XI 

Characterization of emal Homologs From Other Biocatalysis Strains 

Seventeen Streptomyces sp. strains, including strains R-922 and 1-1529, were identified 
that are capable of catalyzing the regiospecific oxidation of the 4"-carbinol of avermectin to a 
ketone. Next, the isolation and characterization of the genes encoding the biocatalysis enzyme 
from all of these strains was accomplished. 

To do this, genomic DNA was isolated from the strains and was evaluated by restriction 
with several restriction endonucleases and Southern hybridization with the emal gene. A 
specific restriction endonuclease was identified for each DNA that would generate a single 
DNA fragment of a defined size to which the emal gene hybridizes. For each strain, there 
was only one strongly hybridizing DNA fragment, thus suggesting that other P450 genes were 
not detected under the high stringency hybridization conditions used in these experiments. 
Each DNA was digested with the appropriate restriction endonuclease, and the DNA was 
subjected to agarose gel electrophoresis. DNA in a narrow size range that included the size of 
the emai-hybridizing fragment was excised from the gel. The size selected DNA was ligated 
into an appropriate cloning plasmid and this ligated plasmid was used to transform E. coli. 
The E. coli clones from each experiment were screened by colony hybridization with the emal 
gene fragment to identify clones containing the emal -homologous DNA fragment. 
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The nucleotide sequence of the cloned DNA in each emal -homologous clone was 
determined and examined for the presence of a gene encoding a P450 enzyme with homology 
to emal. In this way, emal -homologous genes were isolated from 14 of the 15 other active 
strains. The nucleotide and deduced amino acid sequences of these are referenced in Table 4 
as SEQ ID NOS:5-32 and 94-95. The relationship of these enzymes can be shown in the form 
of a phylogenetic tree. Such a phylogenetic tree can be generated using the commercially 
available GCG Wisconsin software program version 1.0 (Madison, WI). 



Table 4 



Strain Number 




Classification 


SEQ ID NO (nucleotide 
and amino acid, 
respectively) 


R-0922 


emal 


Streptomyces tubercidicus 


2. 1 and 2 


1-1529 


emal 


Streptomyces tubercidicus 


3 and 4 


1053 


ema3 


Streptomyces rimosus 


5 and 6 


R-0401 


ema4 


Streptomyces lydicus 


7 and 8 


1-1525 


ema5 


Streptomyces sp. 


9 and 10 


DSM-40241 


ema6 


Streptomyces chattanoogensis* 


3. 11 and 12 


IHS-0435 


ema7 


Streptomyces sp. 


13 and 14 


C-00083 


ema8 


Streptomyces albofaciens 


15 and 16 


MAAG-7479 


ema9 


Streptomyces platensis 


17 and 18 


A/96-1208710 


emalO 


Streptomyces kasugaensis 


4. 19 and 20 


R-2374 


email 


Streptomyces rimosus 


21 and 22 


MAAG-7027 


emal 2 


Streptomyces tubercidicus 


5. 23 and 24 


Tue-3077 


emal 3 


Streptomyces platensis 


25 and 26 


1-1548 


emal4 


Streptomyces platensis 


27 and 28 


NRRL-2433 


emal 5 


Streptomyces lydicus 


6. 29 and 30 


MAAG-0114 


emal 6 


Streptomyces lydicus 


31 and 32 


DSM-40261 


emal 7 


Streptomyces tubercidicus 


94 and 95 


* This strain was s 


lown to be in the chattanoogensis species by 16s rDNA analysis; however, 



classical taxonomic methods used by the German culture collection (DSMZ) showed it to be 
saraceticus. 



EXAMPLE Xn 

Construction of His-tagged emal and emal Homologs to Facilitate Enzyme Purification 
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In order to purify the P450 Em ai enzyme and the P450 enzymes encoded by the emal 
homologs from other biocatalysis strains, each of the P450 genes was cloned into the E, coli 
expression plasmid pET-28b(+) (commercially available from Novagen, Madison, WI). The 
pET-28 plasmids are designed to facilitate His-tag fusions at either the N-, or C-terminus and 
to provide strong expression of the genes in E. coli from the T7 phage promoter. In many 
cases, the coding sequence of the ema genes begins with the sequence ATGT. These genes 
were amplified by PCR such that the primers on the 5' end incorporated a Pcil recognition site 
(5' AT ATGT 3') at the 5' terminus. The last four bases of the Pcil site correspond to the 
ATGT at the beginning of the ema gene coding sequence. 

PCR primers at the 3' end of the genes were designed to remove the translation stop 
codon at the end of the ema gene coding sequence and to add an Xhol recognition site to the 
3' terminus. The resulting PCR fragments were restricted with Pcil and Xhol to generate Pcil 
ends at the 5' termini and Xhol ends at the 3' termini, thereby facilitating cloning of the 
fragments into pET-28b(+) previously restricted with Ncol and Xhol. Since Pcil and Ncol 
ends are compatible, the fragments were cloned into pET-28b(+) in the proper orientation to 
the T7 promoter and ribosome binding site in the plasmid to provide expression of the genes. 

At the 3' end of each ema gene, the coding sequence was fused in frame at the Xhol site 
to the His-tag sequence followed by a translation stop codon. This results in the production of 
an Ema enzyme with six histidine residues added to the C-terminus to facilitate purification 
on nickel columns. 

In the case of ema genes in which the ATG translation initiation codon is not followed 
by a T nucleotide, the ema genes were amplified by PCR using a different strategy for the 5' 
end. The primers at the 5' end were designed to incorporate a C immediately preceding the 
ATG translation initiation codon and the primers at the 3' end were the same as described 
above. The PCR fragments that were amplified were restricted with Xhol to create an Xhol 
end at the 3'-terminus and the 5' end was left as a blunt end. These fragments were cloned 
into pET-28b(+) that had been restricted with Ncol, but the Ncol ends were made blunt-ended 
by treatment with mung bean exonuclease, and restricted with Xhol. 

In this manner, the ema genes were cloned into pET-28b(+) to create a functional fusion 
with the T7 promoter and the His-tag at the C-terminus as described previously. All His- 
tagged ema genes were sequenced to ensure that no errors were introduced by PCR. 
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Large amounts of the P450Emai and P450Ema2 enzymes were isolated and purified by 
standard protocols. E. coli strain BL21 DE3 (commercially available from Invitrogen; 
Carlsbad, CA) containing the T7 RNA polymerase gene under the control of the inducible tac 
promoter and the appropriate pET-28/^ma plasmid was cultured and the cells were harvested 
and lysed. The lysates were applied to Ni-NTA columns (commercially available from 
Qiagen Inc., Valencia, CA) and the protein were purified according to the procedure 
recommended by the manufacturer. 

Purified His-tagged P450 En iai and P450 Em a2 were highly active in in vitro activity 
assays as evidenced by a high rate of conversion of avermectin to 4"-keto-avemectin. 

EXAMPLE Xm 
Expression of emal in Pseudomonas 

The emal gene constructs were next introduced into P. putida (wildtype P. putida 
commercially available from the American Type Culture Collection, Manassas, Virginia; 
ATCC Nos. 700801 and 17453). The emal and emal/fd233 gene fragments were cloned as 
Pacl/Pmel fragments into the plasmid pUK21 (Viera and Messing, Gene 100:189-194, 1991). 
The fragments were cloned into a position located between the tac promoter (P tac ) and 
terminator (T tac ) on pUK21 in the proper orientation for expression from the tac promoter. 
The Ptac-emai-Ttac and P tac -emal/fd233~T tac gene fragments were removed from pUK21 as 
Bgin fragments and these were cloned into the broad host-range, transmissible plasmid, 
pRK290 (Ditta et al, Proc. Natl Acad. Set USA 77:7347-7351, 1980) to create plasmids 
pRK-emal and pRK-emal/fd233 (Figure 3). These plasmids were introduced into P. putida 
strains ATCC 700801 and ATCC 17453 by conjugal transfer from E. coli hosts by standard 
methodology (Ditta et al, Proc. Natl Acad. Set USA 77:7347-7351, 1980). 

P. putida ATCC 700801 and ATCC 17453 containing plasmids pRK-emal or pRK- 
emal/fd233 were tested for the ability to catalyze the oxidation of avermectin. The results 
shown in Table 3 demonstrate that these strains are able to catalyze this reaction. 

EXAMPLE XIV 
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Identification of Genes Encoding Ferredoxins That Are Active With the 

P450R m^ Monooxygenase 

P450 monooxygenases require two electrons for each hydroxylation reaction catalyzed 
(Mueller et aL, "Twenty-five years of P450 cam research: Mechanistic Insights into Oxygenase 
Catalysis." Cytochrome P450 , 2 nd Edition, P.R. Ortiz de Montellano (ed.), pp. 83-124; Plenum 
Press, NY 1995). These electrons are transferred to the P450 monooxygenase one at a time by 
a ferredoxin. The electrons are ultimately derived from NAD(P)H and are passed to the 
ferredoxin by a ferredoxin reductase. Specific P-450 monooxygenase enzymes have a higher 
activity when they interact with a specific ferredoxin. In many cases, the gene encoding a 
ferredoxin that interacts specifically with a given P450 monooxygenase is located adjacent to 
the gene encoding the P450 enzyme. 

As described above, in addition to the emal gene, four P450 genes from strain R-922 
and seven P450 genes from strain 1-1529 {see Example VI) were isolated and sequenced. In 
some of these, there was sufficient sequence information about the DNA flanking the P-450 
genes to look for the presence of associated ferredoxin genes. By this approach, two unique 
ferredoxin genes were identified from each of the two strains. Ferredoxin genes fd229 and 
fd230 were identified from strain R-922, and fd233 and fdEA were identified from strain T 
1529. In addition, a ferredoxin reductase gene was found to reside adjacent to thefdEA gene 
from strain 1-1529. 

In order to test the biological activity of each of these ferredoxins in combination with 
P450 Em ai, each individual ferredoxin gene was amplified by PCR to produce a gene fragment 
that included a blunt 5 '-end, the native ribosome-binding site and ferredoxin gene coding 
sequence, and a Pmel restriction site on the 3 '-end. Each such ferredoxin gene fragment was 
cloned into the Pmel site located 3' to the emal gene in plasmid pTXJA-emal . In this way, 
artificial operons consisting of the emal gene and one of the ferredoxin genes operably linked 
to a functional promoter were created. 

In the case of the/dEA ferredoxin gene in which a ferredoxin reductase gene,/reEA, 
was found to be located adjacent to the fdEA gene, a DNA fragment containing both the/<2EA 
and freEA genes was generated by a similar PCR strategy. This gene fragment was also 
cloned in the Pmel site of plasmid pTXJA-emal as described for the other ferredoxin genes. 
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Each emal -ferredoxin gene combination was tested for biological activity by 
introduction of the individual ema 7 -ferredoxin gene plasmids into S, lividans strain ZX7. The 
biocatalysis activity derived from each plasmid in S. lividans was determined. Of the four 
different constructs, only the ferredoxin gene fd233 derived from strain 1-1529 provided 
increased activity when compared to the expression of emal alone in the same plasmid and 
host background (see Table 3). The pTU A-emal/fd233 plasmid in S. lividans provided 
approximately 1.5 to 3- fold higher activity compared to the pTXJA-emal plasmid. The other 
three plasmids containing the other ferredoxin genes gave results essentially the same as the 
plasmid with only the emal gene. Likewise, the pTUA-emal/fdEAlfreBA plasmid did not 
yield results different from those of pTUA-emal. The nucleotide and deduced amino acid 
sequences of the fd233 gene are shown in SEQ ID NOs:35 and 36, respectively. 

A BLAST analysis of the nucelotide and amino acid sequences of fd233 revealed that 
the closest matches were to ferredoxins from S. coelicolor (GenBank Accession AL445945) 
and S. lividans (GenBank Accession AF072709). At the nucleotide level, fd233 shares 80 and 
79.8 % identity with the ferredoxin genes from S. coelicolor and S. lividans, respectively. At 
the peptide level, fd233 shares 79.4 and 77.8% identity with the ferredoxins from S. coelicolor 
and S. lividans, respectively. 

Since fd233 is derived from strain 1-1529 and emal is from strain R-922, the proteins 
encoded by the two genes cannot interact with each other in nature. In an approach designed 
to identify a ferredoxin gene from strain R-922 that is homologous to the fd233 gene and that 
might encode a ferredoxin that interacts optimally with the P450 Ema i, the fd2 3 3 gene was used 
as a hybridization probe to a gene library of DNA from strain R-922. A strongly hybridizing 
cosmid, pPEH232, was identified and the hybridizing DNA was cloned and sequenced. 
Comparison of the deduced amino acid sequences from fd233 and the ferredoxin gene on 
cosmid pPEH232, fd232, revealed that they differed in only a single amino acid. 

In a similar manner, plasmid pTXJA-emal ~fd232 was constructed and tested in S. 
lividans ZX7. This plasmid gave similar results as those obtained with plasmid pTXJA-emal- 
fd233 (see Table 3). The nucleotide and deduced amino acid sequences of fd232 are shown in 
SEQ ID NOs:37 and 38, respectively. 

The emal-fd233 operon was also subcloned, as a Pacl-Pmel fragment, into pTBBKA 
and pEAA that had been digested with the same restriction enzymes. 5. lividans 
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ZX7::pTBBKA-emal-fd233, and S. lividans ZK7::pEAA-emal-fd233 were tested in the 
avermectin conversion assay and found to have higher activities than the strains harboring the 
emal gene alone in the comparable plasmids (see Table 3). 

EXAMPLE XV 

Heterologous Expression of P450K mn i and P450F m «2 in Other Cells 

The expression constructs pRK-emal (Example XHI) and pRK~ema2 (created in a way 
analogous to that described in Example XIII for pKK-emal) were mobilized by conjugation 
into three fluorescent soil Pseudomonas strains. Conjugation was performed according to 
standard methods (Ditta et al, Proc. Natl Acad. Sci. USA 77:7347-7351, 1980). The strains 
were: P. fluorescens MOCG134, P. fluorescens Pf-5, and P. fluorescens CHAO. Standard 
resting cell assays for the conversion of avermectin to 4 ? ' -ket o a vermec ti n were conducted for 
each of the transconjugants. For strains Pf-5 and CHAO, the levels of conversion were below 
the detection limit. Strain MOCG134 yielded 3% conversion for emal and 5% for emal. 

In addition, the constructs listed in the Table 5 were introduced into Streptomyces 
avermitilis MOS-0001 by protoplast-mediated transformation (Kieser, T.; Bibb, M.J.; Buttner, 
M.J.; Chater, K.F.; Hopwood, D.A. (eds.): Practical Streptomyces Genetics. The John Innes 
Foundation, Norwich (England), 2000), (Stutzman-Engwall, K. etal. (1999) Streptomyces 
avermitilis gene directing the ratio of B2:Bi avermectins, WO 99/41389). 



Table 5 



Construct 


% Conversion of avermectin, 16 hrs 


None 


0 


pTBBKA-emai 


10.90 +/- 3.48 


pTVA-emal 


5.326 +/- 2.19 


pEAA-emal 


6.74 +/- 0.08 


pTB&KA-emalA/fd233 


28.50 +/- 0.20 


pTVA-emalA/fd233 


23.97 +/- 5.95 
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Wild-type Str. avermitilis MOS-0001 was tested and found to be incapable of the 
oxidation of avermectin to 4"~ketoavermectin. 

Transformed S. avermitilis strains MOS-0001 : :pTBBKA-<?wai , MOS-0001 (pTUA- 
emal\ MOS-0001::pEAA^?nai, MOS-0001 ::pTBBKA-emalA/fd233, and MOS-0001 
(pTXJA-emalA/fd233) were each tested for their ability to oxidize avermectin to 4"-keto- 
avermectin using resting cells. To do this, the whole cell biocatalysis assay described above 
(including analysis method) was performed. Note that for the whole cell biocatalysis assay, 
transformed Streptomyces avermitilis, like strain R-922, was grown in PHG medium and, 
again like strain R-922, had a reaction time of 16 hours (i.e., during which time the 500 mg 
transformed Streptomyces avermitilis wet cells in 10 ml of 50 mM potassium phosphate 
buffer, pH 7.0, were shaken at 160 rpm at 28°C in the presence of 15 }xl of a solution of 
avermectin in isopropanol (30 mg/ml)). 

As shown in Table 5, in the presence of the inducer, thiostrepton (5 Mg/ml), the emal- or 
emalA/fd233-contmnmg strains MOS-0001 ::pTBBKA-e/nai, MOS-0001 ::pTBBKA- 
emalA/fd233, MOS-0001 (pTVA-emal), MOS-0001 (pTUA-emalA/fd233) were found to 
oxidize avermectin to 4"-keto-avermectin as evidenced by the appearance of the oxidized 4"- 
keto-avermectin compound. Note that the S. avermitilis strain MOS-0001 ::pEAA-emal 
demonstrated this oxidation activity in the absence of thiostrepton since in this strain the emal 
gene is expressed from the ermE promoter that does not require induction. 

Thus, expression of the emal P450 monooxygenase gene in various Streptomyces and 
Pseudomonas strains provided recombinant cells that were able to convert avermectin to 4"- 
ketoavermectin in resting cell assays. 

Next, expression and activity of P450 E mai monooxygenase was tested in E. coli. To do 
this, the emal gene was cloned into the E. coli expression plasmid pET-28b(+) (commercially 
available from Novagen, Madison, WI) as described previously. E. coli strain BL21 DE3 
(commercially available from Invitrogen; Carlsbad, CA) that contains the T7 RNA 
polymerase gene under control of the inducible tac promoter and the pET-28/eraai plasmid 
was cultured in 50 ml LB medium containing 5 mg/1 kanamycin in a 250-ml flask with one 
baffle, for 16 hours at 37°C, with shaking at 130 rpm. 0.5 ml of this culture was used to 
inoculate 500 ml LB medium with 5 mg/1 kanamycin in a 2-liter flask with one baffle, and the 
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culture was incubated for 4 hours at 37°C followed by 4 hours and 30°C, with shaking at 130 
rpm throughout. The cells were harvested by centrifugation, washed in 50 niM potassium 
phosphate buffer, and centrifuged again. 

For the resting cell assays, 90 mg wet cells were weighed into deep-well plates in 
triplicate and resuspended in 0.5 ml 50 mM potassium phosphate buffer. For cell-free 
extracts, 4 grams wet cells in 8 ml disruption buffer were disrupted in French press. 

For the resting cell assays, 5 jxl of substrate (2.5 mg/ml in 2-propanol) was added to the 
cell suspension. The plate was sealed with air permeable foil, and the reaction was incubated 
on an orbital shaker at 1000 rpm at 28°C for 22 hours. No conversion of avermectin to 4"- 
ketoavermectin was detected. 

For the cell-free assays, 100 jlxI cell free extract, l\xl substrate solution (20 mg/ml) in 2- 
propanol, 5 \i\ 100 mM NADPH, 10 \i\ ferredoxin, 10 jul ferredoxin reductase, and 374 \i\ 
potassium phosphate buffer pH 7.0 were added as described in Example III, and the assay was 
incubated at 30°C with shaking at 600 rpm for 20 hours. 9.2% +/- 0.3% of avermectin was 
converted to 4"-ketoavermectin. 

Thus, expression of the emal gene in E. coli resulted in the production of the active 
Emal P450 monooxygenase enzyme which, when purified from the cells, was able to convert 
avermectin to 4"-ketoavermectin. 

EXAMPLE XVI 

Identification and Cloning of Genes Encoding Ferredoxin Reductases that Support Increased 

Activity of the P450R I ng L l Monooxygenase 

The electron transport pathway that supports the activity of P450 monooxygenases also 
includes ferredoxin reductases. These proteins donate electrons to the ferredoxin and, as is 
the case with ferredoxins and P450 monooxygenases, specific ferredoxin reductases are 
known to be better electron donors for certain ferredoxins than others. 

According, a number of ferredoxin reductase genes from Streptomyces strains were 
cloned and were evaluated for their impacts on the biocatalysis reaction. To do this, 
numerous bacterial ferredoxin reductase (Fre) protein sequences were retrieved from NCBI 
and aligned with the program Pretty from the GCG package. Two conserved regions, 



74 



WO 02/092801 



PCT/EP02/05363 



approximately 266 amino acid residues apart, were used to make degenerate oligonucleotides 
for PCR. The forward primer (CGSCCSCCSCTSWSSAAS (SEQ ID NO:96; where "S" is C 
or G; and "W" is A or G)) and the reverse primer (SASSGCSTTSBCCCARTGYTC (SEQ ID 
NO:97; where "S" is C or G; "B" is C, G, or T; "R" is A or G; and T is C or T)) were used 
to amplify 800 bp products from the biocatalytically active Streptomyces strains R-922 and I- 
1529. These pools of products were cloned into TOPO TA cloning vectors (commercially 
available from Invitrogen Inc., Carlsbad, CA), and 20 clones each from R922 and 1-1529 were 
sequenced according to standard methods (see, e.g., Current Protocols in Molecular Biology , 
eds. Ausubel et ah, John Wiley & Sons, Inc. 2000). Sequencing revealed that 4 unique fre 
gene fragments were isolated from the strains: three from R922 (fre3 , fre 1 2, /re 14) and one 
from 1-1529 (frel6). The fre3 , fre 12, fre 1 4, and/rel6 gene fragments were used as probes to 
identify full-length ferredoxin reductases from genomic clone banks of Streptomyces strains 
R922 and 1-1529. By this approach, the complete coding sequence of each of the 4 different 
fre genes was cloned and sequenced. The nucleic acid and amino acid sequences are provided 
as follows: frei (SEQ ID NOs:98 and 99);/rel2 (SEQ ID NOs:100 and 101);/rel4 (SEQ ID 
NOs:102 and 103); and/n?16 (SEQ ID NOs:104 and 105). 

In order to assess the biological activity of each fre gene in relation to the activity of 
Emal, each gene was inserted into the emal/fd233 operon described above, 3' to the fd2 3 3 
gene. This resulted in the formation of artificial operons consisting of the e?nal y fd233, and 
individual fre genes that were expressed from the same promoter. The emal/fd233/fre 
operons were cloned into the Pseudomonas plasmid pRK290 and introduced into 3 different 
P. putida strains. These strains were then analysed for Emal biocatalysis activity using the 
whole cell assay and one of the genes, the fre gene fre 16 from strain 1-1529, was found to 
increase the activity of P450 Em ai monooxygenase by approximately 2-fold. This effect was 
strain specific, as it was seen only in one of the P. putida strains, ATCC Desposit No. 17453, 
and not in the other two. In P. putida strain ATCC 17453, the presence of fre gene /rel6 
resulted in 44% conversion of avermectin to 4"-keto-avermectin, as compared to 23% without 
this gene. The other fre genes had no impact on the biocatalysis activity in any of the P. 
putida strains tested. 

In a similar approach, each of the emal/fd233/fre operons were cloned into the 
Streptomyces plasmids pTUA, pTBBKA, and pEAA, and introduced into 5. lividans strain 
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ZX7. In each case there was no impact in 5. lividans by any of the fre genes on biocatalysis 
activity. 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain, using no more than 
routine experimentation, numerous equivalents to the specific substances and procedures 
described herein. Such equivalents are considered to be within the scope of this invention. 

The patent and scientific literature referred to herein establishes knowledge that is 
available to those with skill in the art. The issued patents, applications, and references, 
including GenBank database sequences, that are cited herein are hereby incorporated by 
reference to the same extent as if each was specifically and individually indicated to be 
incorporated by reference. Any conflict between any reference cited herein and the specific 
teachings of this specification shall be resolved in favor of the latter. Likewise, any conflict 
between an art-understood definition of a word or phrase and a definition of the word or 
phrase as specifically taught in this specification shall be resolved in favor of the latter. 
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What is claimed is: 



1. A purified nucleic acid molecule encoding a polypeptide that exhibits an enzymatic 

activity of a P450 monooxygenase and is capable of regioselectively oxidizing the alcohol 
at position 4" of a compound of formular (II) 



HO 




CH 2 R 6 



wherein 

Ri-R 7 represent, independently of each other hydrogen or a substituent; 

m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two 
adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a 
epoxide bridge of the formula 



H O H 



, or a single bond and a methylene bridge of the formula 
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H 




including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 
thereof, in each case in free form or in salt form, 
in order to produce a compound of the formula (HI) 



R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (II) above. 

The nucleic acid molecule of claim 1 , comprising a nucleic acid sequence that encodes a 
polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and 
regioselectively oxidizes avermectin to 4"keto-avermectin. 

The nucleic acid molecule of claims 1 or 2, comprising a nucleic acid sequence that 
encodes a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase, 
which polypeptide is substantially similar, and has between at least 50%, and 99% amino 
acid sequence identity to the polypeptide of SEQ ID NO:2. 




R7 



wherein 
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The nucleic acid molecule of claim 3 comprising a nucleotide sequence 

a) as given in SEQ ID NO:l; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, or the 
complement thereof; 

e) complementary to (a), (b) or (c); 

f) which is the reverse complement of (a), (b) or (c); or 

g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"~keto-avermectin. . 

5. The nucleic acid molecule of claims 1 or 2, comprising a nucleic acid sequence that is 
at least 66 % identical to SEQ ID NO:l. 

6. The nucleic acid molecule of claims 1 or 2, comprising a nucleic acid sequence that 
encodes a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase, 
which polypeptide is substantially similar, and has at least between 60%, and 99% 
amino acid sequence identity to the polypeptide of SEQ ID NO:2, SEQ ID NO:4, SEQ 
ID NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO: 12, SEQ ID NO:14, SEQ ID 
NO: 16, SEQ ID NO: 18, SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID 
NO:26, SEQ ID NO:28, SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID 
NO:95. 

7. The nucleic acid molecule of claims 1 or 2, comprising a nucleic acid sequence that 
encodes a polypeptide that exhibits an enzymatic activity of a P450 
monooxygenase, which polypeptide is immunologically reactive with antibodies raised 
against a polypeptide of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, 
SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16, SEQ ID NO:18, 
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SEQ ID NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, 
SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, or SEQ ID NO:95. 



8. The nucleic acid molecule of claims 1 or 2 comprising a nucleotide sequence 

a) as given in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 
NO:9, SEQ ID NO: 11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID 
NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO: 11, SEQ ID NO: 13, 
SEQ ID NO: 15, SEQ ID NO: 17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, 
SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or 
SEQ ID NO:94 or the complement thereof; 

e) complementary to (a), (b) or (c); and 

f) which is the reverse complement of (a), (b) or (c). 

g) . which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that 
still exhibits an enzymatic activity of a P450 monooxygenase and regioselectively 
oxidizes avermectin to 4"-keto-avermectin. 



9. The nucleic acid molecule of claim 8, comprising a nucleic acid sequence selected from 
the group consisting of SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ 
ID NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID 
NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, SEQ ID NO:31, SEQ ID NO:33, and SEQ ID NO:94. 

10. The nucleic acid molecule of anyone of claims 1 to 9, wherein the nucleic acid molecule 
is isolated from a Streptomyces strain. 
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11. The nucleic acid molecule of anyone of claims 1 to 10 further comprising a nucleic acid 
sequence encoding a tag which is linked to the P450 monooxygenase via a covalent bond. 

12. A polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and is 
capable of regioselectively oxidizing the alcohol at position 4" of a compound of formular 



wherein 

R1-R7 represent, independently of each other hydrogen or a substituent; 

m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two 
adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a 
epoxide bridge of the formula 





, or a single bond and a methylene bridge of the formula 
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including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 
thereof, in each case in free form or in salt form, 
in order to produce a compound of the formula (HI) 




OH), 



wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (II) above. 

13. A polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and 
regioselectively oxidizes avermectin to 4"keto-avermectin. 

14. The polypeptide of claims 12 or 13 that comprises an amino acid sequence that is 
encoded by a nucleic acid molecule 

a) as given in SEQ ID NO:l or the complement thereof; 

b) having substantial similarity to (a); 

c) capable of hybridizing to (a) or the complement thereof; 
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d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, or the 
complement thereof; 

e) complementary to (a), (b) or (c); 

5 f) which is the reverse complement of (a), (b) or (c); or 

g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"-keto-avermectin. 

10 15. The polypeptide of claims 12 to 14, comprising an amino acid sequence that is at least 
50% identical to SEQ ID NO:2. 

16. The polypeptide of claims 12 or 13 comprising an amino acid sequence that is encoded by 
a nucleic acid molecule 

15 a) as given in SEQ ID NO:l, SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID 

NO:9, SEQ ID NO:ll, SEQ ID NO:13, SEQ ID NO:15, SEQ ID NO:17, SEQ ID 
NO: 19, SEQ ID NO:21, SEQ ID NO:23, SEQ ID NO:25, SEQ ID NO:27, SEQ ID 
NO:29, SEQ ID NO:31, SEQ ID NO:33, or SEQ ID NO:94 or the complement thereof; 
b) having substantial similarity to (a); 

20 c) capable of hybridizing to (a) or the complement thereof; 

d) capable of hybridizing to a nucleic acid molecule comprising 50 to 200 or more 
consecutive nucleotides of a nucleotide sequence given in SEQ ID NO:l, SEQ ID 
NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:l 1, SEQ ID NO:13, 
SEQ ID NO:15, SEQ ID NO:17, SEQ ID NO:19, SEQ ID NO:21, SEQ ID NO:23, 

25 SEQ ID NO:25, SEQ ID NO:27, SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, or 

SEQ ID NO: 94 or the complement thereof, or the complement thereof; 

e) complementary to (a), (b) or (c); 

f) which is the reverse complement of (a), (b) or (c); or 
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g) which is a functional part of (a), (b), (c), (d), (e) or (f) encoding a polypeptide that still 
exhibits an enzymatic activity of a P450 monooxygenase and regioselectively oxidizes 
avermectin to 4"-keto-avermectin. 

17. The polypeptide of claim 16, comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6 5 SEQ ID NO:8, SEQ ID 
NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16, SEQ ID NO: 18, SEQ ED 
NO:20, SEQ ID NO:22, SEQ ID NO:24, SEQ ID NO:26, SEQ ID NO:28, SEQ ID 
NO:30, SEQ ID NO:32, SEQ ID NO:34, and SEQ ID NO:95. 



18. The polypeptide of anyone of claims 12 to 17, further comprising a tag. 

19. A binding agent that specifically binds to the polypeptide of anyone of claims 12 to 18. 

15 20. The binding agent of claim 20, wherein the binding agent is an antibody. 

21. A family of polypeptides exhibiting an enzymatic activity of a P450 monooxygenase, 
wherein each member of the family is capable of regioselectively oxidizing the alcohol at 
position 4" of a compound of formular (II) 

20 
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HO, 




R5 (n); 



13 
R4 



R7 



wherein 

R1-R7 represent, independently of each other hydrogen or a substituent; 

m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two 
adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a 
epoxide bridge of the formula 



including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 
thereof, in each case in free form or in salt form, 
in order to produce a compound of the formula (EI) 




, or a single bond and a methylene bridge of the formula 
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wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (II) above. 

22. A family of polypeptides exhibiting an enzymatic activity of a P450 monooxygenase, 
wherein each member of the family oxidizes avermectin to 4"keto-av ermectin . 

23. The family of claims 21 or 22, wherein each member of the family is comprises an amino 
acid sequence that is at least 50% identical to SEQ ID NO:2. 

24. A purified nucleic acid molecule comprising a nucleotide sequence encoding a 
polypeptide exhibiting an enzymatic activity of a ferredoxin and a ferredoxin 
reductase,respectively, wherein the nucleic acid molecule is isolated from a Streptomyces 
strain comprising a P450 monooxygenase that is capable of regioselectively oxidizing the 
alcohol at position 4" of a compound of formular (II) 
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R5 (ID; 



wherein 

R1-R7 represent, independently of each other hydrogen or a substituent; 

m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two 
adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a 
epoxide bridge of the formula 



H O H 



, or a single bond and a methylene bridge of the formula 



including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 
thereof, in each case in free form or in salt form, 
in order to produce a compound of the formula (IE) 
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. m 




(Hi), 



wherein 



R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (II) above. 

25. A purified nucleic acid molecule according to claim 24 comprising a nucleotide sequence 
encoding a polypeptide exhibiting an enzymatic activity of a ferredoxin and a ferredoxin 
reductase, respectively, wherein the nucleic acid molecule is isolated from a 
Streptomyces strain comprising a P450 monooxygenase that regioselectively oxidizes 
avermectin to 4"keto-avermectin. 

26. The nucleic acid molecule of claim 25, comprising a nucleic acid sequence selected from 
the group consisting of SEQ ID NO:35 and SEQ ID NO:37. 

27. The nucleic acid molecule of claim 25, comprising a nucleic acid sequence selected from 
the group consisting of SEQ ID NO:98, SEQ ID NO: 100, SEQ ID NO: 102, and SEQ ID 



28. A polypeptide exhibiting an enzymatic activity of a ferredoxin and a ferredoxin reductase, 
respectively, wherein the polypeptide is isolated from a Streptomyces strain comprising a 



NO: 104. 
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P450 monooxygenase that is capable of regioselectively oxidizing the alcohol at position 
4" of a compound of formular (II) 



R1-R7 represent, independently of each other hydrogen or a substituent; 

m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that two 
adjacent carbon atoms are connected by a double bond, a single bond, a single bond and a 
epoxide bridge of the formula 




wherein 




, or a single bond and a methylene bridge of the formula 
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including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 
thereof, in each case in free form or in salt form, 
in order to produce a compound of the formula (HI) 




wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (II) above. 

29. A polypeptide exhibiting an enzymatic activity of a ferredoxin and a ferredoxin reductase, 
respectively, wherein the ferredoxin protein is isolated from a Streptomyces strain 
comprising a P450 monooxygenase that regioselectively oxidizes avermectin to 4"keto- 
avermectin. 

30. The ferredoxin protein of claim 29, comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO:36 and SEQ ID NO:38. 

31. The ferredoxin reductase protein of claim 29, comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:99, SEQ ID NO: 101 , SEQ ID NO: 103, 
and SEQ ID NO: 105. 
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32. A cell genetically engineered to comprise a nucleic acid molecule encoding a polypeptide 
that exhibits an enzymatic activity of a P450 monooxygenase according to anyone of 
claims 1 to 11. 

33. The cell of claim 32 further comprising a nucleic acid molecule encoding a ferredoxin 
protein and a ferredoxin reductrase protein, respectively, or a combination thereof. 

34. The cell of claims 32 or 33, wherein the nucleic acid molecule is positioned for 
expression in the cell. 

35. The cell of anyone of claims 32 to 34, wherein the cell is a genetically engineered cell 
selected from the group consisting of a Streptomyces strain cell and a Pseudomona strain 
cell, and an Escherichia coli strain cell. 

36. The cell of claim 35, wherein the cell has NRRL Designation No. B-30478 and NRRL 
Designation No.B-30479, respectively. 

37. A method for the preparation a compound of the formula 




R7 
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in which 

R1-R9 represent, independently of each other hydrogen or a substituent; 
m is 0, 1 or 2; 

n is 0, 1, 2 or 3; and 

the bonds marked with A, B, C, D, E and F indicate, independently of each other, that 
two adjacent carbon atoms are connected by a double bond, a single bond, a single bond 
and a epoxide bridge of the formula 



including, where applicable, an E/Z isomer, a mixture of E/Z isomers, and/or a tautomer 

thereof, in each case in free form or in salt form, 

which process comprises 

1) bringing a compound of the formula 




, or a single bond and a methylene bridge of the formula 





R7 
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wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, 
into contact with a polypeptide according to the invention that is capable of 
regioselectively oxidising the alcohol at position 4" in order to form a compound of the 
formula 



in which Ri, R 2 , R3, R4 ? R5, R6, R7, m, n, A, B, C, D, E and F have the meanings given 
for formula (I); and 

2) reacting the compound of the formula (HI) with an amine of the formula HN(R 8 )Rc>, 
wherein R 8 and R9 have the same meanings as given for formula (I), and which is 
known, in the presence of a reducing agent; 

and, in each case, if desired, converting a compound of formula (I) obtainable in 
accordance with the process or by another method, or an E/Z isomer or tautomer thereof, 
in each case in free form or in salt form, into a different compound of formula (I) or an 
E/Z isomer or tautomer thereof, in each case in free form or in salt form, separating a 
mixture of E/Z isomers obtainable in accordance with the process and isolating the 
desired isomer, and/or converting a free compound of formula (I) obtainable in 
accordance with the process or by another method, or an E/Z isomer or tautomer thereof, 
into a salt or converting a salt, obtainable in accordance with the process or by another 



O 




R7 
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method, of a compound of formula (I) or of an E/Z isomer or tautomer thereof into the 
free compound of formula (I) or an E/Z isomer or tautomer thereof or into a different 
salt. 

38. A method for the preparation of a compound of the formula 




(m), 



in which Ri, R 2 , R3, R4, R5, Re, R7, m, n, A, B, C, D, E and F have the meanings given for 
formula (EI) of claim 37, 
which process comprises 

1) bringing a compound of the formula 
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R5 (D); 



13 
R4 



wherein 

R1-R7, m, n, A, B, C, D, E and F have the same meanings as given for formula (I) above, 
into contact with a polypeptide according to the invention that is capable of 
regioselectively oxidising the alcohol at position 4", maintaining said contact for a time 
sufficient for the oxidation reaction to occur and isolating and purifying the compound 
of formula (II). 

39. A method according to anyone of claims 37 or 38 for making emamectin, comprising 
adding a polypeptide that exhibits an enzymatic activity of a P450 monooxygenase and 
regioselectively oxidizes avermectin to 4"keto-avermectin to a reaction mixture 
comprising avermectin and incubating the reaction mixture under conditions that allow 
the polypeptide to regioselectively oxidize avermectin to 4"keto-avermectin. 

40. The method of anyone of claims 37 to 39, wherein the reaction mixture further comprises 
a ferredoxin protein. 

41. The method of anyone of claims 37 to 40, wherein the reaction mixture further comprises 
a ferredoxin reductase protein. 
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42. A formulation for making eraamectin comprising a polypeptide that exhibits an 
enzymatic activity of a P450 monooxygenase and regioselectively oxidizes avermectin to 
4"keto-avermectin. 

5 

43. The formulation of claims 42 further comprising a ferredoxin protein. 



44. The formulation of claim 42 or 43 further comprising a ferredoxin reductase protein. 
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Figure 2 
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Figure 3 
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SEQUENCE LISTING 
<110> Syngenta Participations AG 

<120> METHODS AND COMPOSITIONS FOR MAKING EMAMECTIN 

<130> PB/5-60016A 

<140> 
<141> 

<150> US 60/291,149 
<151> 2001-05-16 

<160> 105 

<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 1293 

<212> DNA 

<213> Streptomyces tuber ci di cus 



<400> 1 

atgtcggaat taatgaactc tccgttcgcc gcgcacgtcg ggaaacaccc gggcgagccg 60 

aatgtgatgg accccgccct gatcaccgac ccgttcaccg gctacggcgc gctgcgtgag 120 

cagggcccgg tcgtacgggg ccggttcatg gacgactcgc ccgtctggct ggtgacgcgg 180 

ttcgaggagg tccgccaggt cctgcgcgac cagcggttcg .tgaacaatcc ggcctcgccg 240 

tccctgaact acgcgcccga ggacaacccg ctgacccggc tgatggagat gctgggcctc 300 

cccgagcacc tccgcgtcta cctgctcgga tcgatcctca actacgacgc ccccgaccac 360 

acccggctgc gccgtctggt gtcgcgggcg ttcacggccc gcaagatcac cgacctgcgg 420 

ccccgggtcg agcagatcgc cgacgcgctg ctggcccggc tgcccgagca cgccgaggac 480 

ggcgtcgtcg acctcatcca gcacttcgcc taccccctgc cgatcaccgt catctgcgaa 540 

ctggtcggca tacccgaagc ggaccgcccg cagtggcgaa cgtggggcgc cgacctcatc 600 

tcgatggatc cggaccggct cggcgcctcg ttcccggcga tgatcgagca catccatcag 660 

atggtccggg aacggcgcga ggcgctcacc gacgacctgc tcagcgaact gatccgcacc 720 

catgacgacg acggcgggcg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780 

gtcctcgccg gccacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840 

acccaccccg accagctgcg tctggtcaag gacgatccgg ccctcctccc ccgtgccgtc 900 

cacgagctga tgcgctggtg cgggccggtg cacatgaccc agctgcgcta cgccaccgcc 960 

gacgtcgacc tcgccggcac accgatccgc cagggcgatg ccgttcaact catcctggta 1020 

tcggccaact tcgacccccg tcactacacc gaccccgacc gcctcgatct cacccggcac 1080 

cccgcgggcc acgccgagaa ecatgtgggt ttcggccatg gagcgcacta ctgcctgggc 1140 

gccacactcg ccaaacagga aggtgaagtc gccttcggca aactgctcac gcactacccg 1200 

gacatatcgc tgggcatcgc cccggaacac ctggagcgga caccgctgcc gggcaactgg 1260 

cggctgaact cgctgccggt gcggttgggg tga 1293 



<210> 2 

<211> 430 

<212> PRT 

<213> Streptomyces tubercidicus 



- 1 - 
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<400> 2 

Met Ser Glu Leu Met Asn Ser Pro Phe Ala Ala His Val Gly Lys His 

15 10 15 

Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu lie Thr Asp Pro Phe 

20 25 30 

Thx Gly Tyr Gly Ala Leu Arg Glu Gin Gly Pro Val Val Arg Gly Arg 

35 40 45 

Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val 

50 55 60 

Arg Gin Val Leu Arg Asp Gin Arg Phe Val Asn Asn Pro Ala Ser Pro 
65 70 75 80 

Ser Leu Asn Tyr Ala Pro Glu Asp Asn Pro Leu Thr Arg Leu Met Glu 

85 90 95 

Met Leu Gly Leu Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser lie 

100 105 110 

Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser 

115 120 125 

Arg Ala Phe Thr Ala Arg Lys lie Thr Asp Leu Arg Pro Arg Val Glu 

130 135 140 

Gin lie Ala Asp Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp 
145 150 155 160 

Gly Val Val Asp Leu Jle Gin His Phe Ala Tyr Pro Leu Pro Jle Thr 

165 170 ~ 175 

Val lie Cys Glu Leu Val Gly lie Pro Glu Ala Asp Arg Pro Gin Trp 

180 185 190 

Arg Thr Trp Gly Ala Asp Leu lie Ser Met Asp Pro Asp Arg Leu Gly 

195 200 205 

Ala Ser Phe Pro Ala Met lie Glu His lie His Gin Met Val Arg Glu 

210 215 220 

Arg Arg Glu Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu lie Arg Thr 
225 230 235 240 

His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Met 

245 250 255 

lie Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu lie 

260 265 270 

Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gin Leu Arg Leu 

275 280 285 

Val Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met 

290 295 300 

Arg Trp Cys Gly Pro Val His Met Thr Gin Leu Arg Tyr Ala Thr Ala 
305 310 315 320 

Asp Val Asp Leu Ala Gly Thr Pro He Arg Gin Gly Asp Ala Val Gin 

325 330 335 

Leu He Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro 

340 345 350 

Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His 

355 360 365 

Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala 

370 375 380 

Lys Gin Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Thr His Tyr Pro 
385 390 395 400 

Asp He Ser Leu Gly He Ala Pro Glu His Leu Glu Arg Thr Pro Leu 

405 410 415 

Pro Gly Asn Trp Arg Leu Asn Ser Leu Pro Val Arg Leu Gly 
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420 425 430 



<210> 3 
<211> 1293 
<212> DNA 

<213> Streptoinyces tubercidicus 
<400> 3 

atgtcggcat tatccagctc tccgttcgct gcgcatgtcg ggaaacaccc gggtgagccg 60 

aatgtgatgg agccggcgct gctcaccgac ccgttcgcgg gctacggcgc gctgcgtgag 120 

caggccccgg tcgtacgggg ccggttcgtg gacgactcac cggtctggtt cgtgacgcgc 180 

ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggccgcgccg 240 

cccctggccc catcggccga ggagaacccg ctgaccaggc tgatggacat gctgggcctc 300 

cccgagcacc tccgcgtcta catgctcggg tcgattctca actacgacgc ccccgaccac 360 

acccggctgc gccgtctggt gtcgcgcgcg ttcacggcgc ggaagatcac cgatctgcga 420 

ccgcgtgtcg agcagatcgc cgacgagctg ctggcccgcc tccccgagta cgccgaggac 480 

ggcgtcgtcg acctcatcca gcatttcgcc tacccgctgc cgatcaccgt catctgcgag 540 

ctggtcggca tacccgaagc ggaccgcccg cagtggcgga agtggggcgc cgacctcatc 600 

tcgatggacc cggaccggct cggcgcaacg ttcccggcga tgatcgagca catccatgag 660 

atggtccggg agcggcgcgc ggcgctcacc gatgatctgc tcagcgagct gatccgtacc 720 

catgacgacg atggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780 

gtcctcgccg gtcacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840 

acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc ccgggccgtc 900 

catgaactga tgcgctggtg cgggccggtg cagatgacgc agctgcgcta cgcggccgcc 960 

gacgtcgacc tcgccggtac gcggatccac aagggcgacg ccgtacaact cctcctggtt 1020 

gcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgatct gacgcgtcac 1080 

cccgccggcc acgccgagaa ccatgtgggt ttcggccacg gtgcgcatta ctgcctgggt 1140 

gccaccctcg ccaagcagga gggcgaagtc gcgttcggca agctgctcgc gcactacccg 1200 

gagatgtccc tgggcatcga accggaacgt ctggagcgat tgccgctgcc tggcaactgg 1260 

cggctgaatt ccctgccgtt gcggctgggg tga 1293 



<210> 4 

<211> 430 

<212> PRT 

<213> Streptoinyces tubercidicus 



<400> 4 



Met 


Ser 


Ala 


Leu 


Ser 


Ser 


Ser 


Pro 


Phe 


Ala Ala His Val Gly Lys His 


1 








5 










10 15 


Pro 


Gly 


Glu 


Pro 


Asn 


Val 


Met 


Glu 


Pro 


Ala Leu Leu Thr Asp Pro Phe 








20 










25 


30 


Ala 


Gly 


Tyr 


Gly 


Ala 


Leu 


Arg 


Glu 


Gin 


Ala Pro Val Val Arg Gly Arg 






35 










40 




45 


Phe 


Val 


Asp 


Asp 


Ser 


Pro 


Val 


Trp 


Phe 


Val Thr Arg Phe Glu Glu Val 




50 










55 






60 


Arg 


Gin 


Val 


Leu 


Arg 


Asp 


Gin 


Arg 


Phe 


Val Asn Asn Pro Ala Ala Pro 


65 










70 








75 80 


Pro 


Leu 


Ala 


Pro 


Ser 


Ala 


Glu 


Glu 


Asn 


Pro Leu Thr Arg Leu Met Asp 










85 










90 95 


Met 


Leu 


Gly 


Leu 


Pro 


Glu 


His 


Leu 


Arg 


Val Tyr Met Leu Gly Ser He 








100 










105 


110 


Leu 


Asn 


Tyr 


Asp 


Ala 


Pro 


Asp 


His 


Thr 


Arg Leu Arg Arg Leu Val Ser 
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115 




120 


Tktct 


Ala 


Phe 


Thr 


Ala Arc Lvs Tie 




130 






135 


Gin 


lie 


Ala 


As]p 


Glu Leu Leu Ala 


145 








150 


Gly 


Val 


Val 


Asp 


Leu He Gin His 










165 


Val 


lie 


Cys 


Glu 


Leu Val Glv He 








180 




Arg 


Lys 


Tro 


Gly 


Al a A^'o Lou Tie 






195 




200 


Ala 


Thr 


Phe 


Pro 


Ala Met He Glu 




210 






215 




Arcr 


Ala 


Ala 


T,oi i Thr~ A*^T) Asd 


225 








230 


His 


Asp 




Asp 


(21 \z (21 \/ 7\t~ci T ,pi i 










245 


Tl c=> 


JLtCU. 


J. ±J_L 


T .oi i 

J_lt=Cl 


Val Lou Ala Glv 








260 






A cm 




JL x J-£_ 


S] a Ala T ,P1 1 T iOI 1 






97R 
/ «j 




Z. O 


T »oi i 


j-jys 






p-rn A 1 pi T .oi i T .oi i 




290 






295 




ix p 




CXI v 


p rn VpjI (21 n Mof- 

IrxU VCtX or_LA± l/lts L» 


305 








310 




V d_L 




T .oi i 


7\1 a (~l~\\r TVit* Aytt 

o_LCt VjX^ X1JJL riiy 










~> z. -J 


LeU 




T .oi i 


Val 


Alp* Al pi Aqn T^Vio 
r\-L cJ. .rULCl ritoli JTllti 








J> f± VJ 








T .oi i 


Sen 


T,pn T'Vit" Ar*rr T-Tn c> 

J_J^—CL 1111 rVLy 1 J.-L »3> 






355 




360 


Val 


Gly 


Phe 


Gly 


His Gly Ala His 




370 






375 


Lys 


Gin 


Glu 


Gly 


Glu Val Ala Phe 


385 








390 


Glu 


Met 


Ser 


Leu 


Gly He Glu Pro 










405 


Pro 


Gly 


Asn 


Trp 


Arg Leu Asn Ser 



420 



125 



Thr 


Asp 


Leu 


Arg 


Pro 


Arg 


Val 


Glu 








140 










Arg 


Leu 


Pro 


Glu 


Tvr 


Ala 


Glu 


Asp 






155 










160 


Phe 


Ala 


Tvr 


Pro 


Leu 


Pro 


He 


Thr 




170 










175 




Pro 


Glu 


Ala 


Asp 


Arg 


Pro 


Gin 


Tro 


185 










190 






Ser 


Met 


Asp 


Pro 


Asp 


Arg 


Leu 


Glv 










205 








His 


He 


His 


Glu 


Met 


Val 


Arg 


Glu 








220 










Leu 


Leu 


Ser 


Glu 


Leu 


He 


Arg 


Thr 






235 










240 


Ser 


Asp 


Val 


Glu 


Met 


Val 


Thr 


Met 




250 










255 




His 


Glu 


Thr 


Thr 


Ala 


His 


Leu 


He 


265 










270 






Thr 


His 


Pro 


Asp 


Gin 


Leu 


Arg 


Leu 










285 








Pro 


Arg 


Ala 


Val 


His 


Glu 


Leu 


Met 








300 










Thr 


Gin 


Leu 


Arg 


Tvr 


Ala 


Ala 


Ala 






315 










320 


He 


His 


Lys 


Gly 


Asp 


Ala 


Val 


Gin 




330 










335 




Asp 


Pro 


Arg 


His 


Tvr 


Thr 


Asp 


Pro 


345 










350 






Pro 


Ala 


Glv 


His 


Ala 


Glu 


Asn 


His 










~J \J ~J 








Tyr 


cys 


Leu 


Gly 


Ala 


Thr 


Leu 


Ala 








380 










Gly 


Lys 


Leu 


Leu 


Ala 


His 


Tyr 


Pro 






395 










400 


Glu 


Arg 


Leu 


Glu 


Arg 


Leu 


Pro 


Leu 




410 










415 




Leu 


Pro 


Leu 


Arg 


Leu 


Gly 






425 










430 







<210> 5 
<211> 1413 
<212> DNA 

<213> Streptomyces rimosus 
<400> 5 

atgaccacat cgcccaccga gtcccgggcg gccaccccgc ccgactccac cgcctccccc 60 
tcgaccgctt ccgccccggc caccacccct tcggccgccg cctctccgga caccaccgac 120 
cgca.cca.cgc tcccctccta cgtcggcctc cacccgggcg agccgaacct gatggaaccg 180 
gagctgetgg agaaccegta caccggctac ggcacgctgc gegagcagge cccgctcgtc 240 
cgcgcccggt tcatcgacga ctcgcccatc tggctggtga cccgcttcga cgtggtgcgc 300 
gaggtgatgc gtgaccagcg gttegtcaac aacccgaccc tggtgcccgg catcggcgcg 360 
gacaaggacc cgcgtgcccg gctgatcgag ctgttcggca tccccgagga cctggccccg 420 
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tacctcaccg acaacatcct caccagcgac ccgccggacc acacccggct gcgccgcctg 480 

gtctcccgcg ccttcaccgc acgccgtatc caggacctgc ggccgcgcgt cgagcggatc 540 

accgacgagc tgctggaacg gctgccggac catgccgagg acggcgtcgt cgacctcgtc 600 

gagcacttcg cctacccgct gcccatcacg gtcatctgcg agctggtcgg catcgacgag 660 

gaggatcggg cgctgtggcg gcggttcggc gccgacctcg cctcgctgaa ccccaagcgc 720 

atcggcgcca ccatgccgga gatgatctcg cacatccacg agctgatcga cgaacggcgc 780 

gcggccctgc gggacgacct gctcagcggg ctcatccggg cgcaggacga cgacggcggc 840 

cggctgagcg acgtcgagat ggtcaccctg gtcctgaccc tggtactggc cggtcacgag 900 

accaccgccc acctcatcag caacggcacc ctcgccctgc tcacccaccc cgaccagcgg 960 

cggctgatcg acgaggaccc ggcgctgctg ccgcgcgcgg tccacgagct gatgcgctgg 1020 

tgcgggccga tccaggccac ccagcttcgg tacgccctgg aggacaccga ggtggccgga 1080 

gtccaggtcc gccagggcga ggccctgatg ttcagcctcg tcgcggccaa ccacgacccg 1140 

cgccactaca ccgggccgga gcggctcgac ctgacgcggc agccggccgg ccgcgccgag 1200 

gaccacgtcg gcttcggcca cggcatgcac tactgcctgg gtgcctcact cgcccggcag 1260 

gaggccgagg tggcctacgg gaagctgctc acccgctacc cggacctggc gctcgccctc 1320 

accccggaac agttggagga ccaggaacgc ctgcggcagc ccggcacctg gcgcctgcga 1380 

cggctgccgc tgaggctgca cgcgcagagc tga 1413 



<210> 6 

<211> 470 

<212> PRT 

<213> Strep tomyces rimosus 



<400> 6 



Met 


Thr 


Thr 


Ser 


Pro 


Thr 


Glu 


Ser 


Arg 


Ala Ala Thr Pro Pro Asp Ser 


1 








5 










10 15 


Thr 


Ala 


Ser 


Pro 


Ser 


Thr 


Ala 


Ser 


Ala 


Pro Ala Thr Thr Pro Ser Ala 








20 










25 


30 


Ala 


Ala 


Ser 


Pro 


Asp 


Thr 


Thr 


Asp 


Arg 


Thr Thr Leu Pro Ser Tyr Val 






35 










40 




45 


Gly 


Leu 


His 


Pro 


Gly 


Glu 


Pro 


Asn 


Leu 


Met Glu Pro Glu Leu Leu Glu 




50 










55 






60 


Asn 


Pro 


Tyr 


Thr 


Gly 


Tyr 


Gly 


Thr 


Leu 


Arg Glu Gin Ala Pro Leu Val 


65 










70 








75 80 


Arg 


Ala 


Arg 


Phe 


lie 


Asp 


Asp 


Ser 


Pro 


He Trp Leu Val Thr Arg Phe 










85 










90 95 


Asp 


Val 


Val 


Arg 


Glu 


Val 


Met 


Arg 


Asp 


Gin Arg Phe Val Asn Asn Pro 








100 










105 


110 


Thr 


Leu 


Val 


Pro 


Gly 


lie 


Gly 


Ala 


Asp 


Lys Asp Pro Arg Ala Arg Leu 






115 










120 




125 


lie 


Glu 


Leu 


Phe 


Gly 


lie 


Pro 


Glu 


Asp 


Leu Ala Pro Tyr Leu Thr Asp 




130 










135 






140 


Asn 


lie 


Leu 


Thr 


Ser 


Asp 


Pro 


Pro 


Asp 


His Thr Arg Leu Arg Arg Leu 


145 










150 








155 160 


Val 


Ser 


Arg 


Ala 


Phe 


Thr 


Ala 


Arg 


Arg 


He Gin Asp Leu Arg Pro Arg 










165 










170 175 


Val 


Glu 


Arg 


He 


Thr 


Asp 


Glu 


Leu 


Leu 


Glu Arg Leu Pro Asp His Ala 








180 










185 


190 


Glu 


Asp 


Gly 


Val 


Val 


Asp 


Leu 


Val 


Glu 


His Phe Ala Tyr Pro Leu Pro 






195 










200 




205 


lie 


Thr 


Val 


He 


Cys 


Glu 


Leu 


Val 


Gly 


He Asp Glu Glu Asp Arg Ala 




210 










215 






220 


Leu 


Trp 


Arg 


Arg 


Phe 


Gly 


Ala 


Asp 


Leu 


Ala Ser Leu Asn Pro Lys Arg 
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225 






230 




235 


240 


He 


Gly 


Ala Thr 


Met Pro 


Glu 


Met lie Ser His He His 


Glu Leu He 








245 




250 


255 


Asp 


Glu 


Arg Arg 


Ala Ala 


Leu 


Arg Asp Asp Leu Leu Ser 


Gly Leu He 






260 






265 


270 


Arg 


Ala 


Gin Asp 


Asp Asp 


Gly 


Gly Arg Leu Ser Asp Val 


Glu Met Val 






275 






280 285 




Thr 


Leu 


Val Leu 


Thr Leu 


Val 


Leu Ala Gly His Glu Thr 


Thr Ala His 




290 






295 


300 




Leu 


He 


Ser Asn 


Gly Thr 


Leu 


Ala Leu Leu Thr His Pro 


Asp Gin Arg 


305 






310 




315 


320 


Arg 


Leu 


He Asp 


Glu Asp 


Pro 


Ala Leu Leu Pro Arg Ala 


Val His Glu 








325 




330 


335 


Leu 


Met 


Arg Trp 


Cys Gly 


Pro 


He Gin Ala Thr Gin Leu 


Arg Tyr Ala 






340 






345 


350 


Leu 


Glu 


Asp Thr 


Glu Val 


Ala 


Gly Val Gin Val Arg Gin 


Glv Glu Ala 






355 






360 365 




Leu 


Met 


Phe Ser 


Leu Val 


Ala 


Ala Asn His Asp Pro Arg 


Hi s Tyr Thr 




370 






375 


380 




Gly 


Pro 


Glu Arg 


Leu Asp 


Leu 


Thr Arg Gin Pro Ala Gly 


Arg Ala Glu 


385 






390 




395 


400 


AsiD 


His 


Val Gly 


Phe Gly 


His 


Glv Met His Tvr Cvs Leu 


Glv Ala Ser 

\-J_L._Jf UCJ. 








405 




410 


415 


Leu 


Ala 


Arcr Gin 


Glu Ala 


Glu 




T.can r PVi"K* A rr 
J— it; Li J_±J_L iTJLy 






420 






425 


430 


Tyr 


Pro 


Asp Leu 


Ala Leu 


Ala 


Leu Thr Pro Glu Gin Leu 


Glu Asp Gin 






435 






440 445 




Glu 


Arg 


Leu Arg 


Gin Pro 


Gly 


Thr Trp Arg Leu Arg Arg 


Leu Pro Leu 




450 






455 


460 




Arg 


Leu 


His Ala 


Gin Ser 








465 






470 









<210> 7 
<211> 1293 
<212> DNA 

<213> Streptoiryces lydicus 
<400> 7 

atgtcggcat cacccagcaa cacgttcacc 
aacgtgatgg atccggcgct gatcggggat 
cagggcccgg tcgtgcgggg gcggttcatg 
ttcgaggagg tccgcgaggt cctgcgtgac 
gcgccgggcg cggcccccga ggacaccccg 
cccgagcacc tgcgcgtcta tctgctcggc 
acccggctgc gccgcctggt ctcccgggcc 
ccgcgcgtca cacagatagc cgacgagctg 
ggcgtcgtcg acctgatcca gcacttcgcc 
ctggtcggca tccccgagga ggaccgcccg 
tcgctgcagc cggaccggat gagccggtcc 
ctgatcgcgg cgcggcgccg ggcgctcacc 
catgacgacg acggcagccg gctcagcgac 
gtcctggccg gccacgagac caccgcgcac 
acccaccccg accagctgcg gctgctcaag 



gagcacgtcg gcaagcaccc gggcgagccg 60 

ccgttcgccg gttacggcgc gctgcgcgag 120 

gacgactccc ccgtgtggtt cgtgacccgc 180 

ccgcggttcc ggaacaatcc ggtctccgcg 240 

ctgtcccggc tgatggacat gatgggtttc 300 

tcgatcctca acaacgacgc ccccgaccac 360 

ttcaccgcgc ggaagatcac cgatctgcgg 420 

ctggcccggc tgccggaLgca cgccgaggac 480 

tatcccctgc cgatcaccgt catctgcgaa 540 

cagtggcgca cctggggcgc cgacctggtc 600 

ttcccggcga tgatcgacca catccacgag 660 

gacgatctgc tcagcgagct gatccggacc 720 

gtcgagatgg tcaccatggt cctcaccgtc 780 

ctcatcggca acggcacggc ggccctgctc 840 

gacgacccgg cgctgctgcc gcgcgcggtg 900 
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cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960 

gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020 

tcggcgaacc gcgacccgcg ccactacacc gaccccgacc ggctggacct gacccggcac 1080 

cctgccggcc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140 

gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200 

gagctgtcgc tggccgtcgc gccggaggcc ctggagcgca caccggtacc gggcagctgg 1260 

cggctgaacg cgctgccgct gcgtctgcgc tga 1293 



<210> 8 
<211> 430 
<212> PRT 

<213> Streptoirr/ces lydicus 
<400> 8 

Met Ser Ala Ser Pro Ser Asn Thr Phe Thr Glu His Val Gly Lys His 

15 10 15 

Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu lie Gly Asp Pro Phe 

20 25 30 

Ala Gly Tyr Gly Ala Leu Arg Glu Gin Gly Pro Val Val Arg Gly Arg 

35 40 45 

Phe Met Asp Asp Ser Pro Val Trp Phe Val Thr Arg Phe Glu Glu Val 

50 55 60 

Arg Glu Val Leu Arg Asp Pro Arg Phe Arg Asn Asn Pro Val Ser Ala 
65 70 75 80 

Ala Pro Gly Ala Ala Pro Glu Asp Thr Pro Leu Ser Arg Leu Met Asp 

85 90 95 

Met Met Gly Phe Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser lie 

100 105 110 

Leu Asn Asn Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser 

115 120 125 

Arg Ala Phe Thr Ala Arg Lys He Thr Asp Leu Arg Pro Arg Val Thr 

130 135 140 

Gin He Ala Asp Glu Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp 
145 150 155 160 

Gly Val Val Asp Leu He Gin His Phe Ala Tyr Pro Leu Pro He Thr 

165 170 175 

Val He Cys Glu Leu Val Gly He Pro Glu Glu Asp Arg Pro Gin Trp 

180 185 190 

Arg Thr Trp Gly Ala Asp Leu Val Ser Leu Gin Pro Asp Arg Met Ser 

195 200 205 

Arg Ser Phe Pro Ala Met He Asp His He His Glu Leu He Ala Ala 

210 215 220 

Arg Arg Arg Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu He Arg Thr 
225 230 235 240 

His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Met 

245 250 255 

Val Leu Thr Val Val Leu Ala Gly His Glu Thr Thr Ala His Leu He 

260 265 270 

Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gin Leu Arg Leu 

275 280 285 

Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met 

290 295 300 

Arg Trp Cys Gly Pro Val His Met Thr Gin Leu Arg Tyr Ala Ala Glu 
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305 








Asp 


Val 


Glu 


Le^u AT a 








325 


Leu. 


He 


Leu 


Val Ser 








340 


Asp 


Arcr 


Leu 


Asp Leu 






355 




Val 


Gly 


Phe 


Gly His 




370 






Lys 


Gin 


Glu 


Gly Glu 


385 








Glu 


Leu 


Ser 


Leu Ala 








405 


Pro 


Gly 


Ser 


Trp Arg 



420 



310 315 
Gly Val Arg He Arg Thr 
330 

Ala Asn Arg Asp Pro Arg 
345 

Thr Arg His Pro Ala Gly 
360 

Gly Ala His Tyr Cys Leu 
375 

Val Ala Leu Gly Ala Leu 
390 395 
Val Ala Pro Glu Ala Leu 
410 

Leu Asn Ala Leu Pro Leu 
425 





J/.U 




Val (Tin 




33S 


Hi s Tvr Thr 




350 




His Ala Glu 


Aqn W-i o 


365 






T < ^*» t A "7 
J-JtSU fLLa 


380 




Leu Arg His 


Phe Pro 




400 


Glu Arg Thr 


Pro Val 




415 


Arg Leu Arg 




430 





<210> 9 
<211> 1299 
<212> DNA 

<213> Streptoinyces sp. 
<400> 9 

atgtcagcct tatccagctc tccgttcgcc gagcacatag ggaaacaccc gggcgagccg 60 

aacgtgatgg aaccggctct gatcaacgat ccgttcggcg gctacggcgc gctgcgcgag 120 

oaggggccgg ttgtgcgtgg ccggttcatg gacgactcgc ccgtgtggtt cgtgacccgc 180 

ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggcgtcgccg 240 

ctcctgggca gtcaggtcga ggagatgccg atggtcaagc tgctggagca gatgggcctc 300 

cccgagcacc ttcgggtcta tctgctcgga tcgatcctca acagtgacgc ccccgatcac 360 

acccggcttc gccgcctcgt ctcgcgggcc ttcaccgcac gtaagatcac cggtctgcgg 420 

ccgcgcgtcg agcagatcgc cgacgagctg ctggcccggc tccccgagca cgccgaggac 480 

ggcgtcgtcg acctcatcca gcacttcgcc tacccgctgc cgatcacggt catctgcgaa 540 

ctggtcggca tacccgaagc cgatcgcccg caatggcgcg catggggcgc cgacctcgtg 600 

tcactggagc cggacaagct cagcacgtcg ttcccggcga tgatcgacca cacccatgaa 660 

ctgatccgcc aacggcgcgg cgcgctcacc gacgatctgc tcagcgagcfc gatccgtgcc 720 

catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt gttcgctctc 780 

gtcttcgccg gtcacgagac caccgcccac ctcataggca acggcacggc ggcgctgctc 840 

acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc gcgtgccgtc 900 

catgagctga tgcgctggtg cgggccggtg cacatgaccc agttgcgtta cgcctccgag 960 

gacatcgacc tcgccggtac gccgatccgg aagggcgacg ccgtccaact catcctggta 1020 

tcggcgaact tcgacccccg ccactacagc gaccccgatc gcctcgacct gacccgtcac 1080 

cccgcaggcc acgccgagaa ccacgtgggc ttcggccacg ggatgcacta ctgcttgggc 1140 

gccgcgctcg ccaggcagga aggcgaagtg gcgttcggca aactgctcgc gcactacccg 1200 

gaegtagcgc tgggcgtcga accggaagcc ctggagcggg tgccgatgcc cggcagttgg 1260 

cggctgaatt ccttgccgct gcggttggcg aagcgctaa 1299 



<210> 10 
<211> 432 
<212> PRT 

<213> Streptomyces sp. 
<400> 10 

Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Glu His He Gly Lys His 
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1 








5 




Pro 


Gly 


Glu 


Pro 


Asn Val Met 

xiOIJ. VQX 


Glu 








20 






Gly 


Gly 




Gly 


Al ^ T.on At~ct 


Glu 






35 






40 


Phe 


Met 


Asp 


Asp 


Ser Pro Val 


Trp 




50 






55 




Arg 


Gin 


Val 


Leu 


Aver A^o Gl ti 


Arcr 


65 








70 




Leu 


Leu 


Glv 


Ser 


Gin Val Glu 


Glu 










85 




Gin 


Met 


ffl V 


Leu 


Pro Gl u Hi <=5 

-L _L LJ v_r j_ u. hid 


Leu 








100 






Leu 


Asn 


Ser 


Asp 


Al ^ Pro Acsn 


His 






115 






120 


Aver 


Ala 


P"ho 

JTJ.J.C: 


X 111 


Al a ATYT TiA/C? 


Tie 

JL JLC: 




1 30 






135 




ril -n 

VJH1 


J__Lt^ 


Al Pi 


A C*T> 


f-Tl n T >oi i T ,m i 

VJ J. LI J-jtiLt Ut-U. 


Al a 










JL _> VJ 






VCtJ- 


Val 


A CIT^ 

.ri.ojky 


T .oi i T 1 o fZl n 

LlCU -Li-C Vj-Lli 


J1-L& 










1 fiS 




Val 
V d J_ 


Tl o 


^ y o 


Pin 

vzrJL LI 


T.on \7^1 Cll \r 

JjttiLl V d_L Vjr_L_y 


Tl 








180 

-LOW 






Arg 


i-\_Lci 






AT a A gt^i T oi i 
£\±,CX ribp JUti Ll 


VclJL 






J. -73 






9DO 

U VJ 


TTtk* 
1 1 1 1 




Pno 


XT J_ KJ 


Al 3 Mot" Tl O 
Cl lit: L- JL JL fc2 


rib]-/ 














Arg 


Arg - 




AT a 


T oi i Tn*i ~y~ A en 
licU. 1 ill -tA-O^J 


Asp 


225 








230 




Xl J_ iz> 




A OT^k 




ml ~\t Qqt A "ttt 


T .on n 














V;=n 


PHo 
xl IfcJ 




Leu 


\t^1 pv>p Al a 


ril w 
oiy 








260 








-t-i&ll 


Vjj_L y 


li.lL 


Al ^ Alp T.on 


JJC Li 














Leu 


Lys 


Asp 


Asp 




Leu 




290 






295 








\ _y o 




1T1 LJ VCll LLlO 


Met 


305 








310 




Asp 


lie 


Asp 


Leu 


Ala Glv Tlrr 


Pro 










325 




Leu 


He 


Leu 


Val 


Qpr- Al ^ Asn 


Phe 








340 






Asp 


Arcr 


Leu 


Asp 


I»eu Thr Arcr 


His 






355 






360 


Val 


Gly 


Phe 


Gly 


His Gly Met 


His 




370 






375 




Arg 


Gin 


Glu 


Gly 


Glu Val Ala 


Phe 


385 








390 




Asp 


Val 


Ala 


Leu 


Gly Val Glu 


Pro 










405 




Pro 


Gly 


Ser 


Trp 


Arg Leu Asn 


Ser 



420 





10 


15 




Pro 


Ala 


Leu He Asn Asr> Pro 


Phe 


25 




30 




Gin 


Glv 


Pro Val Val Arcr Glv 








45 




Phe 


Val 


Thr Ara Phe Glu Glu 

X. X 1 1- XXX* N^j X. X XVv \J -1* LA W -1— Wi 


Val 






60 




Phe 


Val 


Asn Asn Pro Ala Ser 


Pro 






75 


80 


Met 


Pro 


Met Val Lvs Leu Leu 

X 1»— V wlX. X— I y M J — * \-X. XJV^UL 


Glu 




90 


95 




Arcj 


Val 


Tvr Leu Leu Glv Ser 

J-JTU- J—JS^U. Jt.JV-.V-l. VjfJLJT U&JL 


lie 


105 




110 




Thr 


Arcr 


T tet i Arcr Arcr T .oi i \7Vg 1 


Ser 






125 




Thr 


Gly 


t.oii Arcr Pro At~ct \Tr^ 1 


Glu 






140 






Leu 


Prn Gin Hi q Ala Gin 

J. 1 L> VJ JL LL liliS £\JL CL vjXLl 


Asp 






155 


i fin 

1UU 


Phe 


Ala 


TVr ~Pvo T.on Prn Tl o> 

Ijl 1 J, U J_ICZLL JL J_ V_/ 11C 


Thr 
iiii 




1 70 






L 1 LJ* 


VjJL U. 


7\1 p A en Avrr Pvn dim 
jtt_Lct zt-L y Jtr _L LJ vjJ_11 


±rp 


1 ftS 

lOJ 




1 Q0 




cpr 


T .oi i 


mlii Pyn A cm Ta/ci T.on 
vjj-U- it jl w jMojJ ±jy j_it;uL 


Oci 










xiio 


11JJL 


TTt c; (Til n T.on Tl o Arrr 

1 l-L O VJT J_ LI i—ifei Li. JLXti i^JLy 


VJJ_11 






990 




T .oi i 


T ,oi i 


Cor flJln T.on Tl o Awr 

OCTJL VJ_LLL UcU. J — Ltr z-ij_y 


Al a 






At ~J ~J 








\/pil nln Moh "\7>»1 Thr 

V CIX VJlU l r lti L. Val 11J-L. 


l v lti L- 




250 

/Ci KJ 






ni to 


VjJL u. 


rp'h-v' r PT-ir~ Al a Wi o T.on 
nil 1 ill .rVld xixt? -i_itrLL 


Tl o 


265 




270 




Thr 


His 


Prn Ac?n C^l t\ T.on Attt 
JlIU jt\o^ vjjlii ucu jtll y 


Leu 






285 




Pro 


-ti-L y 


Ala Val Hi'q Gin T.p»n 

rllCl V CIJL JL 1 JLO vj J_ Li. iJCLl 


Met 






300 




Thr 


Gin 


Tien Arcr Tvr Ala fipr 


Glu 






315 


320 


lie 


Arcr 


Lvs Glv A so Al a Val 


Gin 




330 


335 




Asp 


Pro 


Arcr Hi r Tvr 5ior~ Asd 


Pro 


345 




350 




Pro 


Ala 


Gly His Ala Glu Asn 


His 






365 




Tvr 


Cys 


Leu Gl v Al a Al a Leu 


Ala 






380 




Gly 


Lys 


Leu Leu Ala His Tyr 


Pro 






395 


400 


Glu 


Ala 


Leu Glu Arg Val Pro 


Met 




410 


415 




Leu 


Pro 


Leu Arg Leu Ala Lys 


Arg 


425 




430 
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<210> 11 
<211> 1293 
<212> DNA 

<213> Streptornyces chattanoogenesis 
<400> 11 

atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60 

aacgtgatgg atccggcgct gatcggtgat ccgttcgccg gttacggcgc gctgcgcgag 120 

cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180 

ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240 

gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300 

cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360 

acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420 

ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480 

ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540 

ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600 

tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660 

ctgatcgcgg cgcggcgccg ggcgctcacc gacgacctgc tcagcgagct gatccggacc 720 

catgacgacg acggcagcag gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780 

gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840 

acccaccccg accagctgcg gctgctcaag gacgacccgg cactgctgcc gcgcgcggtg 900 

cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960 

gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020 

tcggcgaacc gcgacccgcg ccactacacc gaccccgacc gtctggacct gacccggcac 1080 

cccgccggtc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140 

gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200 

gagctgtcgc tggccgtcgc gccggacgcc ctggagcgca caccggtacc gggcagctgg 1260 

cggctgaacg cgctgccgct gcgtctgggc tga 1293 



<210> 12 

<211> 430 

<212> PRT 

<213> Streptornyces chattanoogenesis 



<400> 12 



Met 


Ser 


Ala 


Ser 


Pro 


Ser 


Asn 


Thr 


Phe 


Thr Glu His Val Gly Lys His 


1 








5 










10 15 


Pro 


Gly 


Glu 


Pro 


Asn 


Val 


Met 


Asp 


Pro 


Ala Leu He Gly Asp Pro Phe 








20 










25 


30 


Ala 


Gly 


Tyr 


Gly 


Ala 


Leu 


Arg 


Glu 


Gin 


Gly Pro Val Val Arg Gly Arg 






35 










40 




45 


Phe 


Met 


Asp 


Asp 


Ser 


Pro 


Val 


Trp 


Phe 


Val Thr Arg Phe Glu Glu Val 




50 










55 






60 


Arg 


Glu 


Val 


Leu 


Arg 


Asp 


Pro 


Arg 


Phe 


Arg Asn Asn Pro Val Ser Ala 


65 










70 








75 80 


Ala 


Pro 


Gly 


Ala 


Ala 


Pro 


Glu 


Asp 


Thr 


Pro Leu Ser Arg Leu Met Asp 










85 










90 95 


Met 


Met 


Gly 


Phe 


Pro 


Glu 


His 


Leu 


Arg 


Val Tyr Leu Leu Gly Ser He 








100 










105 


110 


Leu 


Asn 


Asn 


Asp 


Ala 


Pro 


Asp 


His 


Thr 


Arg Leu Arg Arg Leu Val Ser 






115 










120 




125 


Arg 


Ala 


Phe 


Thr 


Ala 


Arg 


Lys 


He 


Thr 


Asp Leu Arg Pro Arg Val Thr 
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130 






135 140 






Gin 


lie 


Ala 


Asp Glu Leu 


Leu Ala Arg Leu Pro Glu His 


Ala Glu Asp 


145 






150 


155 




160 


Gly 


Val 


Val 


Asp Leu lie 


Gin His Phe Ala Tyr Pro Leu 


Pro He 


Thr 








165 


170 


175 




Val 


lie 


Cys 


Glu Leu Val 


Gly lie Pro Glu Glu Asp Arg 


Pro Gin 


Trp 








180 


185 


190 




Arg 


Thr 


Trp 


Gly Ala Asp 


Leu Val Ser Leu Gin Pro Asp 


Arg Met 


Ser 






195 




200 205 






Arg 


Ser 


Phe 


Pro Ala Met 


lie Asp His lie His Glu Leu 


He Ala Ala 




210 






215 220 






Arg 


Arg 


Arg 


Ala Leu Thr 


Asp Asp Leu Leu Ser Glu Leu 


He Arg Thr 


225 






230 


235 




240 


His 


Asp 


Asp 


Asp Gly Ser 


Arg Leu Ser Asp Val Glu Met 


Val Thr 


Met 








245 


250 


255 




Val 


Leu 


Thr 


Val Val Leu 


Ala Gly His Glu Thr Thr Ala 


His Ti^n 


He 








260 


265 


270 




Gly 


Asn 


Gly 


Thr Ala Ala 


Leu Leu Thr His Pro Asp Gin 


J— ICLL jt\j_ y 


T .oi i 

JJCU 






275 




280 285 






Leu 


iys 


Asp 


Asp Pro Ala 


Leu Leu Pro Arg Ala Val His 


Glu Leu 


Met 




290 






295 300 






Arg 


Trp 


Cys 


Gly Pro Val 


His Met Thr Gin Leu Arg Tyr 


Ala Ala 


Glu 


305 






310 


315 




320 


Asp 


Val 


Glu 


Leu Ala Gly 


Val Arg lie Arg Thr Gly Asp 


Ala Val 


Gin 








325 


330 


335 




Leu 


lie 


Leu 


Val Ser Ala 


Asn Arg Asp Pro Arg His Tyr 


Thr Asp 


Pro 








340 


345 


350 




Asp 


Arg 


Leu 


Asp Leu Thr 


Arg His Pro Ala Gly His Ala 


Glu Asn 


His 






355 




360 365 






Val 


Gly 


Phe 


Gly His Gly 


Ala His Tyr Cys Leu Gly Ala 


Thr Leu 


Ala 




370 






375 380 






Lys 


Gin 


Glu 


Gly Glu Val 


Ala Leu Gly Ala Leu Leu Arg 


His Phe 


Pro 


385 






390 


395 




400 


Glu 


Leu 


Ser 


Leu Ala Val 


Ala Pro Asp Ala Leu Glu Arg 


Thr Pro 


Val 








405 


410 


415 




Pro 


Gly 


Ser 


Trp Arg Leu 


Asn Ala Leu Pro Leu Arg Leu 


Gly 










420 


425 


430 





<210> 13 
<211> 1290 
<212> DNA 

<213> Streptomyces sp. 
<400> 13 

atgaccgaat tagcggactc ccccttcagc gagcacgtcg gcaaacaccc cggcgagccg 60 
aacgtgatgg aaccggccct gctcaccgat ccgttcaccg gctacggcga actgcgcgaa 120 
cagggcccgg tggtccgcgg ccggttcgcg gacgacaccc ccgtgtggtt catcacccgc 180 
ttcgaggagg cccgcgaggt gctgcgcgac caccggttcg ccaatgcccc cgccttcgcg 240 
gcgggaggtg gaagcggtga cacaccctcc aaccggctga tggaaatcat gggcctgccc 300 
gagcactacc gggtgtacct cgccaacacc atcctcacca tggacgcccc cgacca.ca.cc 360 
cggatccggc gattggtctc ccgggcattc accgcccgta agatcaccga tctgcgaccc 420 
cgggtggagg acatcgcgga cgatctgctg aggcggctgc ccgagcacgc cgaggacggc 480 
gtcgtcgacc tcatcaagca ctacgcctat ccgctgccca taacggtcat ctgcgaactg 540 
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gtgggaattc cggaggaaga ccgactgcag tggcgggatt gggggtccgc gttcgtctcc 600 

ctgcaaccgg afccggctcag caaagcgttc ccggcgatga tcgaacacat tcacgcgctg 660 

atccgcgaac ggcgcgcggc gctcaccgac gatctgctca gcgaactgat ccgggtccat 720 

gacgacgacg gcggccgact cagcgacgtc gaaatggtca cgatggtcct gaccctcgtt 780 

ctcgccggtc atgagaccac cgcccatctc atcggcaacg gcactgccgc gcttctcacc 840 

caccccgacc agctgcacct gctgaaatcc gatccggagc tgctcccacg cgccgtgcac 900 

gagctgatgc gctggtgcgg a.ccggtgca.g atgacgcagt tgcggtacgc caccgaggac 960 

gtcgaggtgg ccggggtgca ggtcaagcag ggcgaagcgg tgctggccat gctggtcgcg 1020 

gcgaaccacg acccccgcca cttcgccgac cccgcccggc tcgacctcac ccgccagccg 1080 

9"cgggccggg ccgagaacca cgtcggtttc ggccacggca tgcactactg cctgggcgcc 1140 

agcctggccc gccaggaggg cgaggtcgcc ttcgggaacc tgctcgcgca ctacccggac 1200 

gtgtcgctgg cggtggaacc ggacgccctc cagcgggtcc cgctgccggg caactggcgg 1260 

ctggccgca.c tgccggtccg gctgcgctga. 1290 



<210> 14 

<211> 429 

<212> PRT 

<213> Streptomyces sp. 



<400> 14 



Met 


Thr 


Glu 


Leu 


Ala 


Asp 


Ser 


Pro 


Phe 


Ser 


Glu His 


Val 


Gly Lys His 


1 








5 










10 






15 


Pro 


Gly 


Glu 


Pro 


Asn 


Val 


Met 


Glu 


Pro 


Ala 


Leu Leu 


Thr 


Asp Pro Phe 








20 










25 








30 


Thr 


Gly 


Tyr 


Gly 


Glu 


Leu 


Arg 


Glu 


Gin 


Gly 


Pro Val 


Val 


Arg Gly Arg 






35 










40 








45 




Phe 


Ala 


Asp 


Asp 


Thr 


Pro 


Val 


Trp 


Phe 


He 


Thr Arg 


Phe 


Glu Glu Ala 




50 










55 








60 






Arg 


Glu 


Val 


Leu 


Arg 


Asp 


His 


Arg 


Phe 


Ala 


Asn Ala 


Pro 


Ala Phe Ala 


65 










70 










75 




80 


Ala 


Gly 


Gly 


Gly 


Ser 


Gly 


Asp 


Thr 


Pro 


Ser 


Asn Arg 


Leu 


Met Glu He 










85 










90 






95 


Met 


Gly 


Leu 


Pro 


Glu 


His 


Tyr 


Arg 


Val 


Tyr 


Leu Ala 


Asn 


Thr He Leu 








100 










105 








110 


Thr 


Met 


Asp 


Ala 


Pro 


Asp 


His 


Thr 


Arg 


He 


Arg Arg 


Leu 


Val Ser Arg 






115 










120 








125 




Ala 


Phe 


Thr 


Ala 


Arg 


Lys 


He 


Thr 


Asp 


Leu 


Arg Pro 


Arg 


Val Glu Asp 




130 










135 








140 






He 


Ala 


Asp 


Asp 


Leu 


Leu 


Arg 


Arg 


Leu 


Pro 


Glu His 


Ala 


Glu Asp Gly 


145 










150 










155 




160 


Val 


Val 


Asp 


Leu 


lie 


Lys 


His 


Tyr 


Ala 


Tyr 


Pro Leu 


Pro 


He Thr Val 










165 










170 






175 


He 


Cys 


Glu 


Leu 


Val 


Gly 


He 


Pro 


Glu 


Glu 


Asp Arg 


Leu 


Gin Trp Arg 








180 










185 








190 


Asp 


Trp 


Gly 


Ser 


Ala 


Phe 


Val 


Ser 


Leu 


Gin 


Pro Asp 


Arg 


Leu Ser Lys 






195 










200 








205 




Ala 


Phe 


Pro 


Ala 


Met 


He 


Glu 


His 


He 


His 


Ala Leu 


He 


Arg Glu Arg 




210 










215 








220 






Arg 


Ala 


Ala 


Leu 


Thr 


Asp 


Asp 


Leu 


Leu 


Ser 


Glu Leu 


He 


Arg Val His 


225 










230 










235 




240 


Asp 


Asp 


Asp 


Gly 


Gly 


Arg 


Leu 


Ser 


Asp 


Val 


Glu Met 


Val 


Thr Met Val 










245 










250 






255 


Leu 


Thr 


Leu 


Val 


Leu 


Ala 


Gly 


His 


Glu 


Thr 


Thr Ala 


His 


Leu He Gly 
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260 



Asn 


Gly 


Thr 


Ala 


Ala 


Leu 


Leu 


Thr 






275 










280 


Lys 


Ser 


Asp 


Pro 


Glu 


Leu 


Leu 


Pro 




290 










295 




Trp 


Cys 


Gly 


Pro 


Val 


Gin 


Met 


Thr 


305 










310 






Val 


Glu 


Val 


Ala 


Gly 


Val 


Gin 


Val 










325 








Met 


Leu 


Val 


Ala 


Ala 


Asn 


His 


Asp 








340 










Arg 


Leu 


Asp 


Leu 


Thr 


Arg 


Gin 


Pro 






355 










360 


Gly 


Phe 


Gly 


His 


Gly 


Met 


His 


Tyr 




370 










375 




Gin 


Glu 


Gly 


Glu 


Val 


Ala 


Phe 


Gly 


385 










390 






Val 


Ser 


Leu 


Ala 


Val 


Glu 


Pro 


Asp 










405 








Gly 


Asn 


Trp 


Arg 


Leu 


Ala 


Ala 


Leu 



420 



265 








270 






His 


Pro 


Asp Gin 


Leu 


His 


Leu 


Leu 








285 








Arg 


Ala 


Val His 


Glu 


Leu 


Met 


Arcr 






300 










Gin 


Leu 


Arg Tyr 


Ala 


Thr 


Glu 


Asp 






315 








320 


Lys 


Gin 


Gly Glu 


Ala 


Val 


Leu 


Ala 




330 








335 




Pro 


Arg 


His Phe 


Ala 


Asp 


Pro 


Ala 


345 








350 






Ala 


Gly 


Arg Ala 


Glu 


Asn 


His 


Val 








365 








Cys 


Leu 


Gly Ala 


Ser 


Leu 


Ala 


Arg 






380 










Asn 


Leu 


Leu Ala 


His 


Tyr 


Pro 


Asp 






395 








400 


Ala 


Leu 


Gin Arg 


Val 


Pro 


Leu 


Pro 




410 








415 




Pro 


Val 


Arg Leu 


Arg 









425 



<210> 15 
<211> 1428 
<212> DNA 

<213> Streptomyces albofaciens 
<400> 15 

atgaccacat cgcccaccga gtcccgggcg gccaccccgc ccgactccac cgcctccccc 60 

tcgaccgctg ccgccccggc caccacccct tcggccgccg cctctccgga caccacctct 120 

cccgccacca ccgaccgcac cacgctcccc tcctacgtcg gcctccaccc gggcgagccg 180 

aacctgatgg aaccggagct gctggacaac ccgtacaccg gctacggcac gctgcgcgag 240 

caggcgccgc tcgtccgcgc ccggttcatc gacgactcgc ccatctggct ggtgacccgc 300 

ttcgacgtgg tgcgcgaggt gatgcgcgac cagcggttcg tcaacaaccc gaccctggtg 360 

cccggcatcg gtgcggacca ggacccgcgc gcccggctga tcgagctgtt cggcatcccc 420 

gaggacctgg ccccgtacct caccgacacc atcctcacca gcgacccgcc ggaccacacc 480 

cggctgcgcc gcctggtctc ccgtgccttc accgcacgcc gtatccagga cctgcggccg 540 

cgcgtcgagc ggatcaccga cgagctgctg gcgcggctgc cggaccatgc cgaggacggc 600 

gtcgtcgacc tcgtcgagca cttcgcctac ccgctgccca tcacggtcat ctgcgaactg 660 

gtcggcatcg acgaggagga ccgggcgctg tggcggcggt tcggcgccga cctcgcctcg 720 

ctgaacccca agcgcatcgg cgccaccatg ccggagatga tcgcgcacat ccacgaggtg 780 

atcgacgagc ggcgtgcgga cctgcgggac gacctgctca gcgggctcat ccgggcgcag 840 

gacgacgacg gcggccggct gagcgacgtc gagatggtca cgctggtgct gaccctggtg 900 

ctggccggtc a.cgagciccac cgcccacctc atcagcaacg gceiccctcgc cctgctcacc 960 

caccccgacc agcggcggct gatcgacgag gacccggcgc tgctgccgcg cgcggtccac 1020 

gagctgatgc gctggtgcgg gccgatccag gccacccagc tgcggtacgc catggaggac 1080 

accgaggtgg ccggtgtcca ggtccgccag ggcgaggccc tgatgttcag cctcgtcgcg 1140 

gccaaccacg acccgcgcca ctacaccggc ccggagcggc tcgacctgac gcggcagccg 1200 

gccggccgcg ccgaggacca cgtcggcttc gggcacggga tgcactactg cctgggtgcc 1260 

tcactggccc ggcaggaggc cgaggtggcg tacggcaagc tgctcacccg ctacccggac 1320 

ctggcgctcg cgctcacccc ggaacagctg gaggaccagg aacgcctgcg gcagcccggc 1380 

acctggcgcc tgcgacggct gccgctgagg ttgcacgcgg agagctga 1428 
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<210> 16 

<211> 475 

<212> PRT 

<213> Streptonyces albofaciens 



<400> 16 



Met 


Thr 


Thr 


Ser 


Pro 


Thr 


Glu 


Ser 


Arcr Ala Ala Thr Pro Pro Asr> 9er 


1 








5 








10 15 


Thr 


Ala 


Ser 


Pro 


Ser 


Thr 


Ala 


Ala 


7\1 3 Prn Ala Thr* TVrr Prn fipr Ala 








20 










25 "30 


Ala 


Ala 


Ser 


Pro 


Asp 


Thr 


Thr 


Ser 


Prn Al a Thr* Thr A^n Arrr Thr" Thr- 

Jt »u nxci iiJ-L xiizajw' niy J-in. liii 






~ > o 










40 




Leu 


Pro 


Ser 




Val 


cn v 


Leu 


His 


Prn Cil "V Cil n Prn J\c>n T,pii Mot - nln 
-L j_ l> vjjjujy vjj_u ri.u riDii xj^li nt. oXU. 




SO 

-J L/ 










— ■> 






-L »1_ 


cn ii 


J— ix^Ut 


T,<=*7 1 






Prn 
jr j_ w 




Trir* Cil at - THrr CL~\-\r Tttk- T.<=»n Arrr fTllTi 
-i x j_l vjj j-_y J- _y x vjj_Ljy i iix xjc; Li r^x y vjx Li 


UJ 










7D 






/ -J OU 


Vex XII 


Al a 

Hid. 


IT J- LJ 


Leu 


Val 


A wt 


AT a 


ni y 


irilti lie /Atoki iibp o€3X rIO X1S lip 


















J7L/ ^3 


J_>eU 


Val 










VciX 


Val 
VdX 


ru. y ljxu. vax luec f\ry /isp vjjXH Hiy 








1 on 

1UU 










xuo xxu 


P"h<=» 

JT.L±t£ 


vela. 


A cm 


-rt.»3JX 


i7X W 


X11X 


T iCil 7 




Prn m-vr Tl r2l\r Ala Z\qd dl t~i Acn 
1TX LJ uxy J — Lt: LrrX_y Ala iibp vuXIl fibp 






j-ij 










ion 
X*£ u 


XZ D 


JrX O 


Arg 


Hla 


Arg 


Leu 


xxe 


r*i n 


Leu 


±rne biy xxe irxo biu Asp Jbeu Aia 


















i /in 


XTX LJ 


±yr 


T .e^i i 
XifcrLL 


T'Vir 




J.1JX 


J — Lt: 


XjfcJU. 


XIXL Ofc=X -r\oJL> IrXO JrxO I\t? JL> XlXS J.XXT 


1 

X*±3 
















Xjj XDU 


ru- y 




y 


r\s~ y 


i— itr u. 


Val 

V CIJ_ 


Q^r~ 


ni y 


Al a P"hf^ Thr* Al a Arvr Arrr Tl o C4l n 










X vJ_> 








1 7fl 1 7R 
X / W X / 3 




Xifcii Ll 


A rrY 


JT-L Ly» 




Val 

V CtX 


Cil 7 7 


i-jxy 


lie XilX irifcjLJ LrlU XitJLi XitiU. jriXd /iXy 








-LOW 










lOJ X _> u 




Prn 




A 1_L & 


Ala 


cn n 

VJJ_ LI 




rilv 

W-Ljf 


Val Val Ac:n T.^n "\ral C4l n PM c? P"h*= 

V d_L VC1_L JJCU VClX VJJLU LlXto XlXcS 






195 










200 


90S 

\J 


Ala 


Tyr 


Pro 


Leu 


Pro 


He 


Thr 


Val 


Tie fVq Cilu l\e*n Val Hlv Tl ^ A^rn 




210 










215 




220 


Glu 


Glu 


Asp 


Arg 


Ala 


Leu 


Trn 


Arg 


Arcr Phe Glv Ala Aso Tieu Ala Ser 


225 










230 






235 240 


Leu 


Asn 


Pro 


Lys 


Arg 


He 


Glv 


Ala 


Thr Met Pro Glu Met He Ala His 










245 








250 255 


He 


His 


Glu 


Val 


He 


Asp 


Glu 


Arg 


Arg Ala Asp Leu Arg Asp Asp Leu 








260 










265 270 


Leu 


Ser 


Gly 


Leu 


He 


Arg 


Ala 


Gin 


Asp Asp Asp Gly Gly Arg Leu Ser 






275 










280 


285 


Asp 


Val 


Glu 


Met 


Val 


Thr 


Leu 


Val 


Leu Thr Leu Val Leu Ala Gly His 




290 










295 




300 


Glu 


Thr 


Thr 


Ala 


His 


Leu 


He 


Ser 


Asn Gly Thr Leu Ala Leu Leu Thr 


305 










310 






315 320 


His 


Pro 


Asp 


Gin 


Arg 


Arg 


Leu 


He 


Asp Glu Asp Pro Ala Leu Leu Pro 










325 








330 335 


Arg 


Ala 


Val 


His 


Glu 


Leu 


Met 


Arg 


Trp Cys Gly Pro He Gin Ala Thr 








340 










345 350 


Gin 


Leu 


Arg 


Tyr 


Ala 


Met 


Glu 


As P 


Thr Glu Val Ala Gly Val Gin Val 






355 










360 


365 


Arg 


Gin 


Gly 


Glu 


Ala 


Leu 


Met 


Phe 


Ser Leu Val Ala Ala Asn His Asp 
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370 




375 


Pro Arg 


His Tyr 


Thr Gly Pro Glu 


385 




390 


Ala Gly 


Arg Ala 


Glu Asp His Val 






405 


Cys Leu 


Gly Ala 


Ser Leu Ala Arg 




420 




Lys Leu 


Leu Thr 


Arg Tyr Pro Asp 




435 


440 


Gin Leu 


Glu Asp 


Gin Glu Arg Leu 


450 




455 


Arg Arg 


Leu Pro 


Leu Arg Leu His 


465 




470 



380 

Arg Leu Asp Leu Thr Arg Gin Pro 
395 400 
Gly Phe Gly His Gly Met His Tyr 

410 415 
Gin Glu Ala Glu Val Ala Tyr Gly 
425 430 
Leu Ala Leu Ala Leu Thr Pro Glu 
445 

Arg Gin Pro Gly Thr Trp Arg Leu 
460 

Ala Glu Ser 
475 



<210> 17 
<211> 1293 
<212> DNA 

<213> Streptoinyces platensis 
<400> 17 

atgtcggcat tacccacctc accgttcgct gcacacgtcg ggaaacaccc gggcgagccg 60 

aatgtgatgg acccggcact gatcaccgac ccgttcaccg gctacggcgc gctgcgcgag 120 

cagggcccgg tcgtccgcgg ccgcttcgtg gacgactcac ccgtctggct ggtgacgcga 180 

ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaaccc ggcggcgccc 240 

tccctgggcc acgcggccga ggacaacccg ctcaccaggc tgatggacat gctgggcctc 300 

cccgagcacc tccgccccta cctcctcgga tcgattctca attacgacgc ccccgaccac 360 

acccggctgc gccgcctggt gtcgcgggcc ttcaccgccc gcaagatcac cgacctgcgg 420 

ccgcgggtcg agcagatcgc cgacgccctg ctggcccggc tgcccgagca cgccgaggac 480 

ggcgtcgtcg atctcatccg gcacttcgcc tacccgctgc cgatcaccgt catctgcgaa 540 

ctggtcggca tacccgaagc ggaccgcccg cagtggcgga cgtggggcgc cgacctcgtc 600 

tcgatggagc cggaccggct caccgcctcg ttcccgccga tgatcgagca catccaccgg 660 

atggtccggg agcggcgcgg cgcgctcacc ggcgatctgc tcagcgagct gatccgtgcc 720 

catgacgacg acggcggccg gctcagcgac gtcgagatgg tcaccttgat cctcacgctc 780 

gtcctcgccg gtcacgagac caccgctcac ctcatcagca acggcacggc ggcgctgctc 840 

acccciccccg a.cca.a.ctgcg cctgctccag gacgacccgg ccctgctccc ccgtgccgtc 900 

cacgagctga tgcgctggtg cgggccggtg cagatgaccc agctgcgtta cgccgccgcc 960 

gacgtcgacc tggccggcac cacgatccac cggggcgacg ccgtccaact catcctggtg 1020 

tcggcgaact tcgacccccg ccactacacc gaccccgacc gcctcgatct gacccgccac 1080 

cccgcgggac atgcggagaa ccatgtgggt ttcggccatg gggcgcacta ctgcctgggc 1140 

gccacactcg ccaagcagga gggcgaagtc gccttcggca aactgctcgc gcactacccg 1200 

gagatggcgt tgggcgtcgc accggagcgc ctggagcgga cgcccctgcc gggcaactgg 1260 

cggctgaacg cgctgccggt gcggttgggg tga 1293 



<210> 18 
<211> 430 
<212> PRT 

<213> Streptomyces platensis 
<400> 18 

Met Ser Ala Leu Pro Thr Ser Pro Phe Ala Ala His Val Gly Lys His 
15 10 15 
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Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu lie Thx Asp Pro Phe 

20 25 30 

Thr Gly Tyr Gly Ala Leu Arg Glu Gin Gly Pro Val Val Arg Gly Arg 

35 40 45 

Phe Val Asp Asp Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val 

50 55 60 

Arg Gin Val Leu Arg Asp Gin Arg Phe Val Asn Asn Pro Ala Ala Pro 
65 70 75 80 

Ser Leu Gly His Ala Ala Glu Asp Asn Pro Leu Thr Arg Leu Met Asp 

85 90 95 

Met Leu Gly Leu Pro Glu His Leu Arg Pro Tyr Leu Leu Gly Ser He 

100 105 110 

Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser 

115 120 125 

Arg Ala Phe Thr Ala Arg Lys He Thr Asp Leu Arg Pro Arg Val Glu 

130 135 140 

Gin He Ala Asp Ala Leu Leu Ala Arg Leu Pro Glu His Ala Glu Asp 
145 150 155 160 

Gly Val Val Asp Leu He Arg His Phe Ala Tyr Pro Leu Pro He Thr 

165 170 175 

Val He Cys Glu Leu Val Gly He Pro Glu Ala Asp Arg Pro Gin Trp 

180 185 190 

Arg Thr Trp Gly Ala Asp Leu Val Ser Met Glu Pro Asp Arg Leu Thr 

195 200 205 

Ala Ser Phe Pro Pro Met He Glu His He His Arg Met Val Arg Glu 

210 215 220 

Arg Arg Gly Ala Leu Thr Gly Asp Leu Leu Ser Glu Leu He Arg Ala 
225 230 235 240 

His Asp Asp Asp Gly Gly Arg Leu Ser Asp Val Glu Met Val Thr Leu 

245 250 255 

He Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu He 

260 265 270 

Ser Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gin Leu Arg Leu 

275 280 285 

Leu Gin Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met 

290 295 300 

Arg Trp Cys Gly Pro Val Gin Met Thr Gin Leu Arg Tyr Ala Ala Ala 
305 310 315 320 

Asp Val Asp Leu Ala Gly Thr Thr He His Arg Gly Asp Ala Val Gin 

325 330 335 

Leu He Leu Val Ser Ala Asn Phe Asp Pro Arg His Tyr Thr Asp Pro 

340 345 350 

Asp Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His 

355 360 365 

Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Thr Leu Ala 

370 375 380 

Lys Gin Glu Gly Glu Val Ala Phe Gly Lys Leu Leu Ala His Tyr Pro 
385 390 395 400 

Glu Met Ala Leu Gly Val Ala Pro Glu Arg Leu Glu Arg Thr Pro Leu 

405 410 415 

Pro Gly Asn Trp Arg Leu Asn Ala Leu Pro Val Arg Leu Gly 
420 425 430 
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<210> 19 
<211> 1293 
<212> DNA 

<213> Streptorryces kasugaensis 
<400> 19 

atgtcggcat cacccagcaa cacgttcacc gagcacgtcg gcaagcaccc gggcgagccg 60 

aacgtgatgg atccggcgct gatcggggat ccgttcgccg gttacggcgc gctgcgcgag 120 

cagggcccgg tcgtgcgggg gcggttcatg gacgactccc ccgtgtggtt cgtgacccgc 180 

ttcgaggagg tccgcgaggt cctgcgtgac ccgcggttcc ggaacaatcc ggtctccgcg 240 

gcgccgggcg cggcccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300 

cccgagcacc tgcgcgtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360 

acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac cgatctgcgg 420 

ccgcgcgtca cacagatagc cgacgagctg ctggcccggc tgccggagca cgccgaggac 480 

ggcgtcgtcg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgcgaa 540 

ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600 

tcgctgcagc cggaccggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660 

ctgatcgcgg cgcggcgccg ggcgctcacc gacgatctgc tcagcgagct gatccggacc 720 

catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780 

gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840 

acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900 

cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960 

gacgtcgagc tggcgggcgt ccggatccgc acgggggacg ccgtccagct catcctggtg 1020 

tcggcgaacc gcgacccgcg ccactacacc gaccccgacc ggctggacct gacccggcac 1080 

cctgccggcc acgcggagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140 

gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200 

gagctgtcgc tggccgtcgc gccggaggcc ctggagcgca caccggtacc gggcagctgg 1260 

cggctgaacg cgctgccgct gcgtctgcgc tga 1293 



<210> 20 

<211> 430 

<212> PRT 

<213> Streptoirryces kasugaensis 



<400> 20 



Met 


Ser 


Ala 


Ser 


Pro 


Ser 


Asn 


Thr 


Phe 


Thr Glu His Val Gly Lys His 


1 








5 










10 15 


Pro 


Gly 


Glu 


Pro 


Asn 


Val 


Met 


Asp 


Pro 


Ala Leu He Gly Asp Pro Phe 








20 










25 


30 


Ala 


Gly 


Tyr 


Gly 


Ala 


Leu 


Arg 


Glu 


Gin 


Gly Pro Val Val Arg Gly Arg 






35 










40 




45 


Phe 


Met 


Asp 


Asp 


Ser 


Pro 


Val 


Trp 


Phe 


Val Thr Arg Phe Glu Glu Val 




50 










55 






60 


Arg 


Glu 


Val 


Leu 


Arg 


Asp 


Pro 


Arg 


Phe 


Arg Asn Asn Pro Val Ser Ala 


65 










70 








75 80 


Ala 


Pro 


Gly 


Ala 


Ala 


Pro 


Glu 


Asp 


Thr 


Pro Leu Ser Arg Leu Met Asp 










85 










90 95 


Met 


Met 


Gly 


Phe 


Pro 


Glu 


His 


Leu 


Arg 


Val Tyr Leu Leu Gly Ser He 








100 










105 


110 


Leu 


Asn 


Asn 


Asp 


Ala 


Pro 


Asp 


His 


Thr 


Arg Leu Arg Arg Leu Val Ser 






115 










120 




125 


Arg 


Ala 


Phe 


Thr 


Ala 


Arg 


Lys 


lie 


Thr 


Asp Leu Arg Pro Arg Val Thr 




130 










135 






140 
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Gin 


lie 


Ala 


Asp 


Glu 


Leu 


Leu 


Ala 


Arg 


Leu 


Pro 


Glu 


His 


Ala 


Glu 


Asp 


145 










150 










155 










160 


Gly 


Val 


Val 


Asp 


Leu 
165 


He 


Gin 


His 


Phe 


Ala 
170 


Tyr 


Pro 


Leu 


Pro 


He 
175 


Thr 


Val 


He 


Cys 


Glu 
180 


Leu 


Val 


Gly 


He 


Pro 
185 


Glu 


Glu 


Asp 


Arg 


Pro 
190 


Gin 


Trp 


Arg 


Thr 


Trp 
195 


Gly 


Ala 


Asp 


Leu 


Val 
200 


Ser 


Leu 


Gin 


Pro 


Asp 
205 


Arcr 


Met 


Ser 


Arg 


Ser 
210 


Phe 


Pro 


Ala 


Met 


He 
215 


Asp 


His 


He 


His 


Glu 
220 


Leu 


He 


Ala 


Ala 


Arg 


Arg 


Arg 


Ala 


Leu 


Thr 


Asp 


Asp 


Leu 


Leu 


Ser 


Glu 


Leu 


He 


Arg 


Thr 


225 










230 










235 










240 


His 


Asp 


Asp 


Asp 


Gly 
245 


Ser 


Arg 


Leu 


Ser 


Asp 
250 


Val 


Glu 


Met 


Val 


Thr 
255 


Met 


Val 


Leu 


Thr 


Val 
260 


Val 


Leu 


Ala 


Gly 


His 
265 


Glu 


Thr 


Thr 


Ala 


His 
270 


Leu 


He 


Gly 


Asia 


Gly 
275 


Thr 


Ala 


Ala 


Leu 


Leu 
280 


Thr 


His 


Pro 


Asp 


Gin 
285 


Leu 


Arcr 


Leu 


Leu 


Lvs 
290 


Asp 


Asp 


Pro 


Ala 


Leu 
295 


Leu 


Pro 


Arcr 


Ala 


Val 
300 


His 


Glu 


Leu 


Met 


Arg 


Tro 


Cys 


Glv 


Pro 


Val 


His 


Met 


Thr 


Gin 


Leu 


Arg 


Tvr 


Ala 


Ala 


Glu 


305 










310 










315 










320 


Asp 


Val 


Glu 


Leu 


Ala 
325 


Glv 


Val 


Arcr 


He 


Arg 
330 


Thr 


Glv 


Asp 


Ala 


Val 
335 


Gin 


Leu 


He 


Leu 


Val 
340 


Ser 


Ala 


Asn 


Arg 


Asp 
345 


Pro 


Arg 


His 


Tvr 


Thr 
350 


Asp 


Pro 


Asp 


Arg 


Leu 
355 


Asp 


Leu 


Thr 


Arg 


His 
360 


Pro 


Ala 


Glv 


His 


Ala 
365 


Glu 


Asn 


His 


Val 


Glv 
370 


Phe 


Glv 


His 


Glv 


Ala 
375 


His 


Tvr 


Cys 


Leu 


Glv 
380 


Ala 


Thr 


Leu 


Ala 


Lys 


Gin 


Glu 


Gly 


Glu 


Val 


Ala 


Leu 


Gly 


Ala 


Leu 


Leu 


Arg 


His 


Phe 


Pro 


385 










390 










395 










400 


Glu 


Leu 


Ser 


Leu 


Ala 
405 


Val 


Ala 


Pro 


Glu 


Ala 
410 


Leu 


Glu 


Arg 


Thr 


Pro 
415 


Val 


Pro 


Gly 


Ser 


Trp 
420 


Arg 


Leu 


Asn 


Ala 


Leu 
425 


Pro 


Leu 


Arg 


Leu 


Arg 
430 







<210> 21 
<211> 1428 
<212> DNA 

<213> Streptonryces rimosus 
<400> 21 

atgaccacat cgcccaccga gtcccgggcg gccaccccga ccggctccac cgcctccccc 60 

tcgaccgctt ccgccccggc caccacccct tcggccgcca cctcttcgga caccacctat 120 

cccgccacca ccgaccgcac cacgctcccc tcctacgtcg gcctccaccc gggcgagccg 180 

aacctgatgg aaccggagct gctggacaac ccgtacaccg gctacggcac gctgcgcgag 240 

caggccccgc tcgtccgtgc ccggttcatc gacgactcgc ccatctggct ggtgacccgc 300 

ttcgacgtgg tgcgcgaggt gatgcgcgac cagcggttcg tcaacaaccc gaccctggtg 360 

cccggcatcg gtgcggacaa ggacccgcgc gcccggctga tcgagctgtt cggcatcccc 420 

gaggacctga ccccgtacct cgccgacacc atcctcacca gcgacccgcc ggaccacacc 480 

cggctgcgcc gcctggtctc ccgtgccttc accgcgcgcc gcatccagga cctgcggccg 540 

cgcgtcgagc agatcaccga cgcgctgctg gagcgactgc cgga.cca.tga cgaggacggc 600 
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gtcgtcgacc tcgtcgagca cttcgcctac ccgctgccca tcacggtcat ctgcgagctg 660 

gtcggcatcg acgaggagga ccggacgctg tggcggcggt tcggcgccga cctcgcctca 720 

ctgaacccca agcgcatcgg cgccaccatg ccggagatga tcgcgcacat ccacgaggtg 780 

atcgacgagc ggcgcgcggc cctgcgggac gacctgctca gcgggctcat ccgggcgcag 840 

gacgacgacg gcggccggct gagcgacgtc gagatggtca ccctggtcct gaccctggtg 900 

ctggccggtc acgagaccac cgcccacctc atcagcaacg gcaccctcgc cctgctcacc 960 

caccccgacc agcggcggct gatcgacgag gaccaggca.c tgctgccgcg cgcggtccac 1020 

gagctgatgc gctggtgcgg gccgatccag gccacccagc tgcggtacgc catggaggac 1080 

accgaggtcg ccggtgtcca ggtccgccag ggcgaggccc tgatgttcag cctcgtcgcg 1140 

gccaaccacg acccgcgcca ctacaccggg ccggagcggc tcgacctgac gcggcagccg 1200 

gccggccgcg ccgaggacca cgtcggcttc gggcacggga tgcactactg cctgggtgcc 1260 

tcactcgccc ggcaggaggc cgaggtggcc tacgggaagc tgctcacccg ctacccggac 1320 

ctggagctcg ctctcacacc ggaacagctg gaggaccagg aacgcctgcg gcagcccggc 1380 

acctggcgcc tgcggcggct gccgctgaag ctgcacgcgc ggagctga 1428 



<210> 22 

<211> 475 

<212> PRT 

<213> Streptonryces rimosus 



<400> 22 



Met 


Thr 


Thr 


Ser 


Pro 


Thr 


Glu 


Ser 


Arcr Ala Ala Thr Pro 


Thr Glv Ser 


1 








5 








10 


15 


Thr 


Ala 


Ser 


Pro 


Ser 


Thr 


Ala 


Ser 


Ala Pro Ala Thr Thr 


Pro Ser Al a 








20 










25 


30 


Ala 


Thr 


Ser 


Ser 


Asp 


Thr 


Thr 


Tyr 


Pro Ala Thr Thr Asp 


Arg Thr Thr 






35 










40 


45 




Leu 


Pro 


Ser 


Tyr 


Val 


Gly 


Leu 


His 


Pro Gly Glu Pro Asn 


Leu Met Glu 




50 










55 




60 




Pro 


Glu 


Leu 


Leu 


Asp 


Asn 


Pro 


Tyr 


Thr Gly Tyr Gly Thr 


Leu Arg Glu 


65 










70 






75 


80 


Gin 


Ala 


Pro 


Leu 


Val 


Arg 


Ala 


Arg 


Phe He Asp Asp Ser 


Pro He Trp 










85 








90 


95 


Leu 


Val 


Thr 


Arg 


Phe 


Asp 


Val 


Val 


Arg Glu Val Met Arg 


Asp Gin Arg 








100 










105 


110 


Phe 


Val 


Asn 


Asn 


Pro 


Thr 


Leu 


Val 


Pro Gly He Gly Ala 


Asp Lys Asp 






115 










120 


125 




Pro 


Arg 


Ala 


Arg 


Leu 


He 


Glu 


Leu 


Phe Gly He Pro Glu 


Asp Leu Thr 




130 










135 




140 




Pro 


Tyr 


Leu 


Ala 


Asp 


Thr 


lie 


Leu 


Thr Ser Asp Pro Pro 


Asp His Thr 


145 










150 






155 


160 


Arg 


Leu 


Arg 


Arg 


Leu 


Val 


Ser 


Arg 


Ala Phe Thr Ala Arg 


Arg He Gin 










165 








170 


175 


Asp 


Leu 


Arg 


Pro 


Arg 


Val 


Glu 


Gin 


He Thr Asp Ala Leu 


Leu Glu Arg 








180 










185 


190 


Leu 


Pro 


Asp 


His 


Ala 


Glu 


Asp 


Gly 


Val Val Asp Leu Val 


Glu His Phe 






195 










200 


205 




Ala 


Tyr 


Pro 


Leu 


Pro 


He 


Thr 


Val 


He Cys Glu Leu Val 


Gly He Asp 




210 










215 




220 




Glu 


Glu 


Asp 


Arg 


Thr 


Leu 


Trp 


Arg 


Arg Phe Gly Ala Asp 


Leu Ala Ser 


225 










230 






235 


240 


Leu 


Asn 


Pro 


Lys 


Arg 


He 


Gly 


Ala 


Thr Met Pro Glu Met 


He Ala His 










245 








250 


255 
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He 


His 


Glu Val 


He Asp 


Glu 


Arg 






260 








Leu 


Ser 


Gly Leu 


He Arg 


Ala 


Gin 






275 






280 


Asp 


Val 


Glu Met 


Val Thr 


Leu 


Val 




290 






295 




Glu 


Thr 


Thr Ala 


His Leu 


lie 


Ser 


305 






310 






His 


Pro 


Asp Gin 


Arg Arg 


Leu 


He 








325 






Arg 


Ala 


Val His 


Glu Leu 


Met 


Arg 






340 








Gin 


Leu 


Arg Tyr 


Ala Met 


Glu 


Asp 






355 






360 


Arcr 


Gin 


Gly Glu 


Ala Leu 


Met 


Phe 




370 






375 




Pro 


Arg 


His Tyr 


Thr Gly 


Pro 


Glu 


385 






390 






Ala 


Gly 


Arcr Ala 


nil ij Asn 


His 


Val 








405 






v— _y o 


JjcU 




Ocl J_rt^ Li 


Al a 


7\ ~y~/-~t 






420 








Lys 


Leu 


Leu Thr 


Arg Tyr 


Pro 


Asp 






435 






440 


Gin 


Leu 


Glu Asp 


Gin Glu 


Arg 


Leu 




450 






455 




Arg 


Arg 


Leu Pro 


Leu Lys 


Leu 


His 


465 






470 







Arg 


Ala 


Ala 


Leu 


Arcr 


Asp 


Asp 


Leu 


265 










270 






Asp 


Asp 


Asp 


Glv 


Gly 


Arg 


Leu 


Ser 










285 








Leu 


Thr 


Leu 


Val 


Leu 


Ala 


Glv 


His 








300 










Asn 


Gly 


Thr 


Leu 


Ala 


Leu 


Leu 


Thr 






315 










320 


Asp 


Glu 


Asp 


Pro 


Ala 


Leu 


Leu 


Pro 




330 










335 




Tro 


Cys 


Gly 


Pro 


He 


Gin 


Ala 


Thr 


345 










350 






Thr 


Glu 


Val 


Ala 


Gly 


Val 


Gin 


Val 










365 








Ser 


Leu 


Val 


Ala 


Ala 


Asn 


His 


Asp 


















Arg 


Leu 


Asp 


Leu 


Thr 


Arcr 


Gin 


Pro 






395 










400 


Gly 


Phe 


Gly 


His 


Gly 


Met 


His 


Tyr 




410 










415 




Gin 


Glu 


Ala 


Glu 


Val 


Ala 


Tyr 


Gly 


425 










430 






Leu 


Glu 


Leu 


Ala 


Leu 


Thr 


Pro 


Glu 










445 








Arg 


Gin 


Pro 


Gly 


Thr 


Trp 


Arg 


Leu 



460 



Ala Arg Ser 
475 



<210> 23 
<211> 1293 
<212> DNA 

<213> Streptomyces tubercidicus 
<400> 23 

atgtcggcat tatccaactc cccgctcgcc gcacatgtcg ggaaacaccc tggcgagccg 60 

aatgtgatgg acccggcgct gatcaccgac ccgttcggcg gctacggcgc actgcgcgag 120 

caaggcccgg tcgtacgggg ccggttcatg gacgactcgc ccgtctggct ggtgacgcgc 180 

ttcgaagagg tccgccaagt cctgcgcgat cagcggttcg tgaacaaccc ggccgcaccg 240 

tccctgggac gctcgatcga cgaaagcccc gcggtcagac ttttggaaat gttggggttg 300 

cccgaccatt tccggccgta tctgctcggg tcgatcctca actacgacgc acccgaccac 360 

acccggctcc gccgactggt ctcgcgcgcc ttcacggcac gcaagatcac cgacctgcgg 420 

ccgcgggtcg agcagatcac cgacgacctg ctgacccggc ttcccgagca cgccgaggac 480 

ggtgtggtcg acctcatcca gcacttcgcc taccccctgc cgatcaccgt gatctgcgaa 540 

ctggtcggca tcgccgaagc ggaccgcccg caatggcgga agtggggagc cgatctcgtc 600 

tcgctggagc cggggcggct gagcaccgcg ttcccggcga tggtcgagca catccatgag 660 

ctgatccgcg agcggcgcgg cgcgctcacc gacgatctgc tcagcgagct gatccgcacc 720 

catgacgacg acggcggccg gctcagcgac atcgagatgg tcaccatgat cctcacgatc 780 

gtcctggccg gccacgagac caccgcccac ctcataggca acggcacggc ggcgctgctc 840 

acccaccccg accagctgcg cctactcaag gacgatccgg cgctgctgcc gcgcgccgtc 900 

cacgagctga tgcgctggtg cgggccggtg cacatgaccc agctgcggtt cgcgtccgag 960 

gacgtcgagg tcgccgggac accgatccac aagggcgacg ccgtacaact catcctggta 1020 
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tcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgacct gacccgccac 1080 

cccgccggcc acgccgagaa ccatgtgggc ttcggccacg gaatgcacta ctgcctgggt 1140 

gccaccctcg ccaaacagga aggcgaagtc gccttctccc gcctcttcac gcactacccg 1200 

gaactgtccc tgggcgtcgc ggcggaccag ctggcgcgga cacaggtacc cggcagctgg 1260 

cggctggaca ccctgccgct gcgactgggg tga 1293 



<210> 24 

<211> 430 

<212> PRT 

<213> Streptorriyces tubercidicus 



<400> 24 



Met 


Ser 


Ala 


Leu 


Ser 


Asn 


Ser 


Pro 


T.pii Al a Al a Hiq \7nl CZl\r Lvq Hie 


1 
















10 15 


Pro 


Gly 


Glu 


Pro 


Asn 


Val 


Met 




Pro Al a Tij=»n Tl <=> Thr A<=m Prn PTif=> 

X7X-W ^TlJLCI XJtrLX J — LC XX XX. rlc) J-J IT JL JT -LJ.tr 








20 












Glv 


Glv 


Tvr 


Gly 


Ala 


Leu 


Arrr 


\JJL LX 


fll -n fll \r Prn \Tp* 1 Arrr nl at- Arrr 






j —) 














Phe 


Met 




A q-n 


Del 


Pro 
XT J_ {-J 


V d_L 


Tm 

xip 


T.e^u \7>il Thr Arrr Php Ciln Cln XTal 






















Gin 


Val 


Leu 


Arg 


A^sp 


Gin 


Arg 


php \7n 1 A c?-n A cm Prn Al Al ^ Prn 

xrxxc: vcijl xiDii riolJ. irxu rixci riJLCl JTXU 


65 










70 






/ -J o w 


Ser 


Leu 


Gly 


Aro - 


Ser 


He 


Asp 


Glu 


Qpr Pro Al ^ ^7>5 1 Arrr T\<=>n T.^n Hln 

OCX XTX. rlXCl VuX /TXy XJC2LX UCU VJXU 










O -J 










Met 


Leu 


Gly 


Leu 


Pro 

XT X- W 




Hi 




Am Pro Tvr T.<=>n T ,m i CXl \r cjpr Tl <=» 








100 










105 110 


Leu 


Asn 


Tvr 


Asp 


Ala 


Pro 


Asp 


His 


Thr Arcf Leu Arcr Arrr Leu Val fipr 






115 










120 


125 


ArCT 


Ala 


Phe 


Thr 


Ala 


Arg 


Lys 


He 


Thr Asp Leu Arg Pro Arg Val Glu 




130 










135 




140 


Gin 


He 


Thr 


Asp 


Asp 


Leu 


Leu 


Thr 


Arg Leu Pro Glu His Ala. Glu Asp 


145 










150 






155 160 


Gly 


Val 


Val 


Asp 


Leu 


He 


Gin 


His 


Phe Ala Tyr Pro Leu Pro He Thr 










165 








170 175 


Val 


He 


Cys 


Glu 


Leu 


Val 


Gly 


He 


Ala Glu Ala Asp Arg Pro Gin Trp 








180 










185 190 


Arg 


Lys 


Trp 


Gly 


Ala 


Asp 


Leu 


Val 


Ser Leu Glu Pro Gly Arg Leu Ser 






195 










200 


205 


Thr 


Ala 


Phe 


Pro 


Ala 


Met 


Val 


Glu 


His He His Glu Leu He Arg Glu 




210 










215 




220 


Arg 


Arg 


Gly 


Ala 


Leu 


Thr 


Asp 


Asp 


Leu Leu Ser Glu Leu He Arg Thr 


225 










230 






235 240 


His 


Asp 


Asp 


Asp 


Gly 


Gly 


Arg 


Leu 


Ser Asp He Glu Met Val Thr Met 










245 








250 255 


He 


Leu 


Thr 


lie 


Val 


Leu 


Ala 


Gly 


His Glu Thr Thr Ala His Leu He 








260 










265 270 


Gly 


Asn 


Gly 


Thr 


Ala 


Ala 


Leu 


Leu 


Thr His Pro Asp Gin Leu Arg Leu 






275 










280 


285 


Leu 


Lys 


Asp 


Asp 


Pro 


Ala 


Leu 


Leu 


Pro Arg Ala Val His Glu Leu Met 




290 










295 




300 


Arg 


Trp 


Cys 


Gly 


Pro 


Val 


His 


Met 


Thr Gin Leu Arg Phe Ala Ser Glu 


305 










310 






315 320 


Asp 


Val 


Glu 


Val 


Ala 


Gly 


Thr 


Pro 


He His Lys Gly Asp Ala Val Gin 



-21 - 



WO 02/092801 



PCT/EP02/05363 



325 



Leu He 


Leu 


Val 


Ser Ala Asn Phe 






340 




Asp Arg 


Leu 


Asp 


Leu Thr Arg His 




355 




360 


Val Gly 


Phe 


Gly 


His Gly Met His 


370 






375 


Lys Gin 


Glu 


Gly 


Glu Val Ala Phe 


385 






390 


Glu Leu 


Ser 


Leu 


Gly Val Ala Ala 








405 


Pro Gly 


Ser 


Trp 


Arg Leu Asp Thr 






420 







330 








335 




Asp 


Pro 


Arg His 


Tyr 


Thr 


Asp 


Pro 


345 








350 






Pro 


Ala 


Gly His 


Ala 


Glu 


Asn 


His 








365 








Tyr 


Cys 


Leu Gly 


Ala 


Thr 


Leu 


Ala 






380 










Ser 


Arg 


Leu Phe 


Thr 


His 


Tyr 


Pro 






395 








400 


Asp 


Gin 


Leu Ala 


Arg 


Thr 


Gin 


Val 




410 








415 




Leu 


Pro 


Leu Arg 


Leu 


Gly 






425 








430 







<210> 25 
<211> 1293 
<212> DMA 

<213> Strep toiryces platens is 
<400> 25 

atgtcggcat tatccagctc accgttcgcc gcgcatgtcg ggaaacaccc gggcgagccg 60 

aatgtgatgg acccggcgct gatcgccgat ccgttcggtg gttatggcgc actgcgtgag 120 

caagggccgg tcgtacgggg ccggttcatg gacgactcac ccgtctggct cgtgacgcgc 180 

ttcgaggaag tccgccaagt cctgcgcgac cagcggttcc tgaacgatcc gacggccccc 240 

tccctggggc gctcattcga cgacagcccc acggccaggc tgctggagat gatgggactg 300 

cccgagcatt tccggccgta tctgctcggt tcgattctga acaacgacgc ccccgaccac 360 

acccggctgc gccgtctggt gtcgcgcgcc ttcacggcac gcaagatcac cgacctgcgg 420 

ccgcgggtcg agcagatcgc cgacgagctg ctgacccggc ttcccgagta cgccgaggac 480 

ggcgtggtcg acctcatcaa gcacttcgcc taccccctgc cgatcgccgt catctgcgaa 540 

ctggtcggca tagccgaagc ggatcgtccg cagtggcgga agtggggtgc cgacctcgtc 600 

tcgctgcagc cggaccggct cagcacctcg ttcccggcga tgatcgagca catccatgag 660 

ctgatccgcg agcggcgcgg ggcgctcacg gacgatctgc tcagcgagct gatccgtgcc 720 

catgacgacg acggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacggtg 780 

gtgctcgccg gccacgagac caccgcgcac ctcataggca acggcacggc ggcgctgctc 840 

acccaccccg accagctgcg gctgctcagg gacgacccgg ctctgtttcc ccgtgccgtc 900 

cacgagctgt tgcgctggtg cgggccggtc cacatgaccc agatgcggtt tgcgtccgag 960 

gatgtcgaca tcgccgggac gaagatccgt aagggcgacg ccgtacaact gatcctggta 1020 

tcggccaact tcgacccccg ccactacacc gaccccgaac gtctcgacct gacccgtcac 1080 

cccgccggcc acgccgagaa ccatgtgggc ttcggccacg ggatgcacta ctgcctgggc 1140 

gccaccc teg ccaaacagga gggcgaagtc gcgttcgaga agctcttcgc gcactacccg 1200 

gaggtgtcgc tgggegtege aceggaacaa ctggaaagga caccactgcc eggcagctgg 1260 

oggctcgatt ccctgccgct gcggttgcgg taa 1293 



<210> 26 
<211> 430 
<212> PRT 

<213> Streptomyces platensis 
<400> 26 

Met Ser Ala Leu Ser Ser Ser Pro Phe Ala Ala His Val Gly Lys His 

15 10 15 

Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu He Ala Asp Pro Phe 
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20 


25 


30 


Glv 


Glv 


Tvr 


Glv Al a Leu Ar cr 


Glu Gin Glv Pro 


Val Val Arrr Glv Arrr 

v cij_ v cut, fix y \jTX-_y nx y 






35 




40 


45 


Phe 


Met 


Asp 


Asra Ser Pro Val 


Tro Leu Val Thr 

JL JL- f-^ -1—1 l^C V L4J. JL X ' ' 


Arcr Phe Gin Gin Val 

JTXX. y XT XX VIS VJJ.U. v_T_l_ vX V Cl J_ 




50 




55 




60 


Arcr 

XXL. y 


Gin 


Val 


T ,on Arcr A^T) Gl n 


Atct Phe Tien A<=tn 

«ly JLXX^ X_IC?IX £xi~D± X 


J\erj pyn Thr Al a Pro 


65 






70 


75 


80 


Ser 


Leu 


Glv 


Arcr Ser Phe Astd 

i ■ km * V \_~_L_ U. X XV^ JTl»-V KmS 


Asd S er Pro Th r 


Ala Arrr Tif^n Tif=*n Gin 
ri — l cx nx y ucu xxc; lx vjt_l. lj. 








85 


90 


95 


Met 


Met 


Glv 


Leu Pro Glu His 


Phe Arcr Pro Tvr 

X. J. XV-- ili, JL J_ v_y JL _y JL. 


T ,pn Tif^n Gl \7 ^pr Tl 

J ICS l_X IJCU. VJ_LJy O Ci- J — LC 








100 


105 


110 


Leu 


Asn 


Asn 


Art) Al a Pro Art) 


Hi s Thr Arcr T.e^n 

ij.x>j juxjlx. nxy J — i\ . ljl 


Arrr Arrr Tif^n \T^i 1 Q/^-v- 
xix y jtxx. y ucu. vcij- *_>"x_ 






115 




120 


125 


Arg 


Ala 


Phe 


Thr Ala Arg Lys 


Tl e Thr As*o Tif=>n 


Arcr Pro Arrr Va 1 Gl 1 1 

rix y xx rix y veil uiu. 




130 




135 




140 


Gin 


lie 


Ala 


Art - ) Gli 3 Lori Tieii 


Thy* Arrr T i Pt~o 

XX XX. jTiJL, y JJCU. X JL L> 


C2.~] -i i T\tt^ Al a. CX~\ ~\ i Aero 
vjjx lx lyx jri-Lci vjtjlu. xt.»d^j 


145 






150 


J- -*J —} 


j- \j \j 




Val 


Val 




H"i •=? Plip Al a T\/r~ 


Pro T m n Pvo Tl n A "1 
x -L- \J J_jtr L4. xrX (J lie jr\J.d 








165 


1 70 




Val 


lie 




Gin T.ou Val Glv 

\_TX Li J IC LX VClJ. VjTX_Y 


Tie Ala Ql n Ala 

XXC fl_Lw VJJLLi. rU-d 










180 


185 




.«x y 


-LJ_Y C3 




fll \/ Ala A <=«r^ T .01 i 


A/al Q^-v- t iP1 cn-r\ 

VCJ.JL iJCl JLltSU. VjJ.ll 


irx A^jky /ix y lieu oer 






i_? _> 




V/ VJ 


ZUj 


Tin 

1111- 


Pel. 


p~ho 

IT lit; 


Pro Al ^ Mot" Tl 


HI n Hiq Tip u-Jc 

VjT-LU. nib J — Ltji n_L 0 


Pin T ^1 1 Tl q A ~r~r~r Pin 

\jx Li jjeu lie /iiy biu 




210 








990 


Arcr 


ArO 
«cxi_ y 




Ala T,on TTrr 2\Qn 


7\ en T ,e^m T.on Cor 
nop ixtTLl UcU Ofc^X. 


UlU jUtiLl 11c fixy r^LLcl 


995 

/Li ~J 






9^0 

~> w 






XIX O 






Ac^-o CX~\ at- f21 at- Arrr 

jrii^Ly VjjXJy V7X_y x-iXy 


jUfciU. Del Val 


VyjlU. I v xt3 U Veil J. Ill IXLGL. 








245 


250 


OCR 
Z J J 


lie 


Leu 


Thr 


Val Val T,<=n Ala 

VdJL V CXX ijCU AXQ 


V3_L_y niiD VU-LLJ. 11J-L 


TPTtt" A 1 =5 TJ-5 c» T i Tlo 
jLIJI rild. xllo xxfcrLi 11c 








260 


265 


970 


Gly 


Asn 


Gly 


Thr Ala Ala T,e>u 

JU X XX AJ. CI ilJ. C*. XX LX 


T.*=ai Thr* "H"-i c? Pro 

iJCU JLlJLL. XIJ-O X J_ W 


A C?T^ n.1 T~i T i A •Y m / m r T .01 T 

r^*3^ VjtIXx J—it^Lx riy if till 






275 




280 




Leu 


Arcr 


Asp 


Aqn Pro Al a 7\on 


Phf=> Prn Arrr Ala 
nic xx l/ Jr\j- y rix cl 


T/al Ri <=! Gin T.on T.P»n 

VCll Illo VJJl U. XltJU lifer Ll 




290 




295 




^00 


Arcr 


Tr~D 


Cys 


Glv Pro Val His 


Mot* Thr ffl n Mot" 

llv^ >_ JL XXJL V_T_LX X HvIk 


Arrr Pino Al ^ot* f-iT n 

rix y Jrllfer Ax ex. OtSl VjlU 


305 






310 


315 


320 


Asp 


Val 


Asp 


Tie Ala Glv Thr 

-< — ex. v_JX. y XX XX 


T,vc! Tl 0 Attt Ta/"c? 
xj_y j — l rU- y jui_y 0 


nlv Aqo Al ^ TZ^ 1 Pin 
oi_y 211a veil vjih 








325 


330 


335 


Leu 


lie 


Leu 


Val SeT Al a Ann 

v v-*- -x_ N — »i- xtlxm. noil 


pV»o Ac-n Pro Arrr 

-LTXXV^ JTiiZJJ'O' XT X- W ill y 


T-Ti <=i Tvr Thr A<=ro Pro 

xxxo i_yx x xxx irlojky xxw 








340 


345 


350 


Glu 


Arcr 


Leu 


Asp Leu Thr Arg 


His Pro Ala Glv 

X X J- PsJ X. -X- V-/ XTA ■ I. L-< VJ _L, 


His Ala Gin Asn His 






355 




360 


365 


Val 


Glv 

\_TXJf 


Phe 


Gly His Gly Met 


His Tvr rVq T.en 


Glv Ala Thr T.^n Ala 

vjjx_y fix a. x xxx ucu ax ex 




370 




375 




380 


Lys 


Gin 


Glu 


Gly Glu Val Ala 


Phe Glu Lys Leu 


Phe Ala His Tyr Pro 


385 






390 


395 


400 


Glu 


Val 


Ser 


Leu Gly Val Ala 


Pro Glu Gin Leu 


Glu Arg Thr Pro Leu 








405 


410 


415 


Pro 


Gly 


Ser 


Trp Arg Leu Asp 


Ser Leu Pro Leu 


Arg Leu Arg 








420 


425 


430 



<210> 27 



-23- 



WO 02/092801 



PCT/EP02/05363 



<211> 1293 
<212> DNA 

<213> Streptomyces platensis 
<400> 27 

atgtcggcat tatccagctc tccgttcgct gcgcatgtcg ggaaacaccc gggtgagccg 60 

aatgtgatgg agccggcgct gctcaccgac ccgttcgcgg gctacggcgc gctgcgtgag 120 

caggccccgg tcgtacgggg ccggttcgtg gacgactcac cggtctggtt cgtgacgcgc 180 

ttcgaggagg tccgccaagt cctgcgcgac cagcggttcg tgaacaatcc ggccgcgccg 240 

cccctggccc caitcggccga ggagaacccg ctgaccaggc tgatggacat gctgggcctc 300 

cccgagcacc tccgcgtcta catgctcggg tcgattctca actacgacgc ccccgaccac 360 

acccggctgc gccgtctggt gtcgcgcgcg ttcacggcgc ggaagatcac cgatctgcga 420 

ccgcgtgtcg agcagatcgc cgacgagctg ctggcccgcc tccccgagta cgccgaggac 480 

ggcgtcgtcg acctcatcca gcatttcgcc tacccgctgc cgatcaccgt catctgcgag 540 

ctggtcggca tacccgaagc ggaccgcccg cagtggcgga agtggggcgc cgacctcatc 600 

tcgatggacc cggaccggct cggcgcaacg ttcccggcga tgatcgagca catccatgag 660 

atggtccggg agcggcgcgc ggcgctcacc gatgatctgc tcagcgagct gatccgtacc 720 

catgacgacg atggcggccg gctcagcgac gtcgagatgg tcaccatgat cctcacgctc 780 

gtcctcgccg gtcacgagac caccgcccac ctcatcagca acggcacggc ggcgctgctc 840 

acccaccccg accagctgcg cctgctcaag gacgacccgg ccctgctccc ccgggccgtc 900 

catgagctga tgcgctggtg cgggccggtg cagatgacgc agctgcgcta cgcggccgcc 960 

gacgtcgacc tcgccggtac gcggatccac aagggcgacg ccgtacaact cctcctggtt 1020 

gcggcgaact tcgacccccg ccactacacc gaccccgacc gtctcgatct gacgcgtcac 1080 

cccgccggcc acgccgagaa cca tgtgggt ttcggccacg gtgcgcatta ctgcctgggt 1140 

gccaccctcg ccaagcagga gggcgaagtc gcgttcggca agctgctcgc gcactacccg 1200 

gagatgtccc tgggcatcga accggaacgt ctggagcgat tgccgctgcc tggcaactgg 1260 

cggctgaatt ccctgccgtt gcggctgggg tga 1293 



<210> 28 

<211> 430 

<212> PRT 

<213> Streptomyces platensis 



<400> 28 



Met 


Ser 


Ala 


Leu 


Ser 


Ser 


Ser 


Pro 


Phe 


Ala Ala His Val Gly Lys His 


1 








5 










10 15 


Pro 


Gly 


Glu 


Pro 


Asn 


Val 


Met 


Glu 


Pro 


Ala Leu Leu Thr Asp Pro Phe 








20 










25 


30 


Ala 


Gly 


Tyr 


Gly 


Ala 


Leu 


Arg 


Glu 


Gin 


Ala Pro Val Val Arg Gly Arg 






35 










40 




45 


Phe 


Val 


Asp 


Asp 


Ser 


Pro 


Val 


Trp 


Phe 


Val Thr Arg Phe Glu Glu Val 




50 










55 






60 


Arg 


Gin 


Val 


Leu 


Arg 


Asp 


Gin 


Arg 


Phe 


Val Asn Asn Pro Ala Ala Pro 


65 










70 








75 80 


Pro 


Leu 


Ala 


Pro 


Ser 


Ala 


Glu 


Glu 


Asn 


Pro Leu Thr Arg Leu Met Asp 










85 










90 95 


Met 


Leu 


Gly 


Leu 


Pro 


Glu 


His 


Leu 


Arg 


Val Tyr Met Leu Gly Ser lie 








100 










105 


110 


Leu 


Asn 


Tyr 


Asp 


Ala 


Pro 


Asp 


His 


Thr 


Arg Leu Arg Arg Leu Val Ser 






115 










120 




125 


Arg 


Ala 


Phe 


Thr 


Ala 


Arg 


Lys 


lie 


Thr 


Asp Leu Arg Pro Arg Val Glu 




130 










135 






140 


Gin 


lie 


Ala 


Asp 


Glu 


Leu 


Leu 


Ala 


Arg 


Leu Pro Glu Tyr Ala Glu Asp 
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145 








150 


Gly 


Val 


Val 


Asp 


Leu He Gin His 










165 


Val 


lie 


Cys 


Glu 


Leu Val Gly He 








180 




Arg 


Lvs 


Trp 


Glv 


Ala Asp Leu He 






195 




200 


Ala 


Thr 


Phe 


Pro 


Ala Met He Glu 




210 






215 


Arg 


Arcr 


Ala 


Ala 


Leu Thr Asp Asp 


225 








230 


His 


Asp 


Asp 


Asp 


Gly Gly Arg Leu 










245 


lie 


Leu 


Thr 


Leu 


Val Leu Ala Glv 








260 




Ser 


Asn 


Glv 


Thr 


Al a Al a Leu Leu 






275 




280 


Leu 


Lys 


Asp 


Asp 


Pt"o Al a Len Leu 




290 






295 


At~ct 


Tro 




Glv 


Pro Val Gin Met" 

X7X. W V CL_L OT-LJ.X 11G V— 


305 








310 


Asp 


Val 






Ala f4l \r fpVir A-yrr 
ilia. vj7_L_y l .r-xx. y 










325 


Leu 


Leu 


Leu 


Val 


Ala Ala Ac?-n P7~i*=» 
jrvj.ca. ^-i_i_cl ri&ii. xrxxc; 








340 




Asp 


Arg 


Leu 


Asp 








355 




360 


Val 


Gly 


Phe 


Gly 


His Gly Ala His 




370 






375 


Lys 


Gin 


Glu 


Gly 


Glu Val Ala Phe 


385 








390 


Glu 


Met 


Ser 


Leu 


Gly He Glu Pro 










405 


Pro 


Gly 


Asn 


Trp 


Arg Leu Asn Ser 



420 





155 










J_ U \J 


Phe 


Ala Tyr 


JT -I- 


Leu 


XT X. 


lie 


X -LXX. 




170 








175 




Pro 


Glu Ala 


nop 


.nx y 


XT X. <J 


Gin 


X -1- JvJ 


185 








1 90 

Z> \J 






Qpr 




Pro 
jr j_ 




r^x. y 


Leu 


ril v 
















His 


Tip Hi «=; 

XX J- O 


Gin 




Val 

V CIX. 


Arg 


vJJ-L U. 






220 










Leu 


T.pi] Cpi-p- 


Glu 


Leu 


Tie 


A T~rr 


X 1XJ_ 




Zj -J 














A c?T^k X 7=3 1 


a~\ n 

VJ-L IX 


xut. t— 


Val 


X XXJL 


X v lfc: 




250 








255 




His 


Glu Thr 


Thr 


Al a 

L CI 


"Hi cj 

XX_LO 


XJtiU. 


Tl & 
J_ _L tr 


265 








970 






Thy 


TTn o Pro 






XlCU 




XJfci LX 
















p-rn 


Avvt Al 


Val 
























1 IlX. 




7\ -y-rr 


j.yr 


Al a 


Al ^ 


Al ^ 
















Tl o 

J — LtS 


His Lys 


ml -\7 


nop 


Al a 


val 


UXli 




J JU 












Asp 


rxu fiiy 


XJ*T C2 

nxS 


-tyr 


XX IX 




Pro 
















-trX O 


Al a fl a/ - 


xl_Lo 


Al ^ 


VJ-L U. 


Asn 


XiJ_i5 








JUJ 








Tyr 


Cys Leu 


tj-Ly 


A±a 


lnr 


Laeu 


A±a 






380 










Gly 


Lys Leu 


Leu 


Ala 


His 


Tyr 


Pro 




395 










400 


Glu 


Arg Leu 


Glu 


Arg 


Leu 


Pro 


Leu 




410 








415 




Leu 


Pro Leu 


Arg 


Leu 


Gly 






425 








430 







<210> 29 
<211> 1293 
<212> DNA 

<213> Streptomyces lydicus 
<400> 29 

atgtcggcat tacccagcaa cacgttcacc gagcaegteg gcaagcaccc gggcgaaccg 60 

aacgtgatgg atccggcgct gateggtgat ccgttcgccg gttaeggege getgegegag 120 

c agggcccgg tcgtgcgggg gcggttcgtg gacgactccc ccgtgtggtt cgtgacccgc 180 

ttcgaggagg tecgegaggt cctgcgggac cagcggttcc ggaacaatcc ggtctcctcg 240 

gcgccggacg cggaccccga ggacaccccg ctgtcccggc tgatggacat gatgggtttc 300 

cccgagcacc tgegegtcta tctgctcggc tcgatcctca acaacgacgc ccccgaccac 360 

acccggctgc gccgcctggt ctcccgggcc ttcaccgcgc ggaagatcac egatctgegg 420 

ccgcgcgtcg cacagatagc cgacgagctg ctggcccggc tgeeggagea cgccgaggac 480 

ggegtegteg acctgatcca gcacttcgcc tatcccctgc cgatcaccgt catctgegaa 540 

ctggtcggca tccccgagga ggaccgcccg cagtggcgca cctggggcgc cgacctggtc 600 

tcgctgcagc eggaceggat gagccggtcc ttcccggcga tgatcgacca catccacgag 660 



-25- 



WO 02/092801 



PCT/EP02/05363 



ctgatcgcgg cgcggcgccg ggcgctcacc gacgacctgc tcagcgagct gatccgaacc 720 

catgacgacg acggcagccg gctcagcgac gtcgagatgg tcaccatggt cctcaccgtc 780 

gtcctggccg gccacgagac caccgcgcac ctcatcggca acggcacggc ggccctgctc 840 

acccaccccg accagctgcg gctgctcaag gacgacccgg cgctgctgcc gcgcgcggtg 900 

cacgagttga tgcgctggtg cggcccggtg cacatgaccc agctgcgcta cgccgccgag 960 

gacgtcgagc tggcgggcgt ccggatccgc aagggggacg ccgtccagct catcctggtg 1020 

tcggcgaacc gcgatccgcg ccactacacc gaacccgacc gtctggacct gacccggcac 1080 

cccgccggcc acgccgagaa ccatgtgggg ttcggccacg gggcgcacta ctgtctgggc 1140 

gccacgctcg ccaagcagga gggcgaggtc gccctcggcg ccctgctcag gcacttcccc 1200 

gagctgtcgc tggccgtcgc gccggacgcc ctggagcgca caccggtacc gggcagctgg 1260 

cggctgaatg cgctgccgct gcgtctgcgc tga 1293 



<210> 30 

<211> 430 

<212> PRT 

<213> Streptoinyces lydicus 



<400> 30 



M(=»t 




rU-d 




Pro 


Ocl 








Thr* (Tin TJ-i c» \Ta~] Cll\r Ta/q T-Tn ci 
±iul vjJLUL nxiD vet J. vj_L_y jjyb ni o 


-L 


















10 1 S 

X W J > 


PrO 


Gly 


Glu 


Pro 


Asn 


Val 


Met 




Pro 


lii-CJ. -l—Jt; LJ_ J_„l_ci VjjJ_j^ Jri.oJ^/ rlU IT±J.t2 








90 
^-i \j 










25 


30 


Ala 


Glv 


Tvr 


Glv 


Ala 


Leu 


A TO" 


Glu 


Gin 


Glv Pro Val Val Arcr Glv Arcr 






35 










40 




45 


Phe 


Val 


Asp 


Asp 


Ser 


Pro 


Val 


Trr> 


Phe 


Val Thr Arcr Phe Glu Glu Val 




50 










55 






60 


Arg 


Glu 


Val 


Leu 


Arg 


Asp 


Gin 


Arg 


Phe 


Arg Asn Asn Pro Val Ser Ser 


65 










70 








75 80 


Ala 


Pro 


Asp 


Ala 


Asp 


Pro 


Glu 


Asp 


Thr 


Pro Leu Ser Arg Leu Met Asp 










85 










90 95 


Met 


Met 


Gly 


Phe 


Pro 


Glu 


His 


Leu 


Arg 


Val Tyr Leu Leu Gly Ser He 








100 










105 


110 


Leu 


Asn 


Asn 


Asp 


Ala 


Pro 


Asp 


His 


Thr 


Arg Leu Arg Arg Leu Val Ser 






115 










120 




125 


Arg 


Ala 


Phe 


Thr 


Ala 


Arg 


Lys 


He 


Thr 


Asp Leu Arg Pro Arg Val Ala 




130 










135 






140 


Gin 


lie 


Ala 


Asp 


Glu 


Leu 


Leu 


Ala 


Arg 


Leu Pro Glu His Ala Glu Asp 


145 










150 








155 160 


Gly 


Val 


Val 


Asp 


Leu 


He 


Gin 


His 


Phe 


Ala Tyr Pro Leu Pro He Thr 










165 










170 175 


Val 


lie 


Cys 


Glu 


Leu 


Val 


Gly 


lie 


Pro 


Glu Glu Asp Arg Pro Gin Trp 








180 










185 


190 


Arg 


Thr 


Trp 


Gly 


Ala 


Asp 


Leu 


Val 


Ser 


Leu Gin Pro Asp Arg Met Ser 






195 










200 




205 


Arg 


Ser 


Phe 


Pro 


Ala 


Met 


He 


Asp 


His 


He His Glu Leu He Ala Ala 




210 










215 






220 


Arg 


Arg 


Arg 


Ala 


Leu 


Thr 


Asp 


Asp 


Leu 


Leu Ser Glu Leu He Arg Thr 


225 










230 








235 240 


His 


Asp 


Asp 


Asp 


Gly 


Ser 


Arg 


Leu 


Ser 


Asp Val Glu Met Val Thr Met 










245 










250 255 


Val 


Leu 


Thr 


Val 


Val 


Leu 


Ala 


Gly 


His 


Glu Thr Thr Ala His Leu He 








260 










265 


270 


Gly 


Asn 


Gly 


Thr 


Ala 


Ala 


Leu 


Leu 


Thr 


His Pro Asp Gin Leu Arg Leu 
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275 










280 








285 








Leu 


Lys 
290 


Asp 


Asp 


Pro 


Ala 


Leu 

295 


Leu 


Pro 


Arg 


Ala Val 
300 


His 


Glu 


Leu 


Met 


Arg 


Trp 


Cys 


Gly 


Pro 


Val 


His 


Met 


Thr 


Gin 


Leu Arg 


Tyr 


Ala 


Ala 


Glu 


305 










310 










315 








320 


Asp 


Val 


Glu 


Leu 


Ala 
325 


Gly 


Val 


Arg 


lie 


Arg 
330 


Lys Gly 


Asp 


Ala 


Val 
335 


Gin 


Leu 


lie 


Leu 


Val 
340 


Ser 


Ala 


Asn 


Arg 


Asp 
345 


Pro 


Arg His 


Tyr 


Thr 
350 


Glu 


Pro 


Asp 


Arg 


Leu 
355 


Asp 


Leu 


Thr 


Arg 


His 
360 


Pro 


Ala 


Gly His 


Ala 
365 


Glu 


Asn 


His 


Val 


Gly 
370 


Phe 


Gly 


His 


Gly 


Ala 
375 


His 


Tyr 


Cys 


Leu Gly 
380 


Ala 


Thr 


Leu 


Ala 


Lys 


Gin 


Glu 


Gly 


Glu 


Val 


Ala 


Leu 


Gly 


Ala 


Leu Leu 


Arg 


His 


Phe 


Pro 


385 










390 










395 








400 


Glu 


Leu 


Ser 


Leu 


Ala 
405 


Val 


Ala 


Pro 


Asp 


Ala 
410 


Leu Glu 


Arg 


Thr 


Pro 
415 


Val 


Pro 


Gly 


Ser 


Trp 

420 


Arg 


Leu 


Asn 


Ala 


Leu 
425 


Pro 


Leu Arg 


Leu 


Arg 
430 







<210> 31 
<211> 1293 
<212> DNA 

<213> Streptoinyces lydicus 
<400> 31 

atgtcggcat cgaccagctc 
catgtgatgg atccggcgct 
cagggaccgg tcgtccgcgg 
ttcgaggaag tccgccaggt 
gcgctcggcg tcgcgccgga 
cccgagcacc tgcacggcta 
acccggctgc gccgcctggt 
ccgcgggtgg cgcagataac 
ggcgtggtcg acctgatcga 
cttgtcggca tcgccgcgga 
tcggtggacc ccgaccggct 
ctgatcggcc agcggcgggc 
catgacgacg acggcagccg 
gtgctggccg gccacgagac 
acccaccccg accagctgcg 
cacgagctga tgcgctggtg 
gacgtcgacc tcgccggcac 
tcggcgaacc acgacccgcg 
cccgcgggcc gcgccgagaa 
gccagcctcg ccaggcagga 
gacctggcgc tggcggtggc 
cggctgacgt cgctgccggt 



tcccctcagc gcccacgtcg gcaagcaccc gggcgaaccc 60 

gatcagcgat ccgttcggcg gctacggtgc cctgcgcgag 120 

acggttcttc gacgactcgc ccttgtggtt agtgacccgc 180 

cctgcgcgac cagcggttcg tgaacaaccc cgccgacccg 240 

ggactccccg cagctgcgcg cgctggcgat gctgggcatc 300 

tctgctcaac tcgatcctca actacgacgc ccccgaccac 360 

ctcccgcgcc ttcaccgccc gcaagatcac cgatcttcgg 420 

cgccgagctg ctggaccgac tcccggagca cgccgaggac 480 

gcacttcgcc tacccgctgc cgatcacggt gatctgcgaa 540 

ggaccggccc cagtggcgtt cctggggcgc cgacctggtc 600 

cggccggacc ttcccggcga tgatcgacca catccacgcg 660 

cgcgctca.cc gacgacctgc tcagcgagct gatccggacc 720 

gctcagcgac gtcgagatgg tcaccctggt cctcaccctc 780 

caccgcacac ctcatcggca acggcaccgc ggccctgctc 840 

gctgctcaag gacgacccgg cgctgctgcc gcgcgccgtc 900 

cgggccggtg cacgtcaccc agctgcggta cgccgccgag 960 

ccggatccgc aggggcgacg ccgtgcaggc cgtcctggtc 1020 

ccactacacc gaccccgaac gcctggacct gacccggcag 1080 

ccacgtgggc ttcgggcacg gggcgcacta ctgcctgggc 1140 

gggtgaggtc gccctgggcg ccctgttcga ccgctacccc 1200 

gcccgaggag ctggagcgca ccccggtgcc cggtacctgg 1260 

gcgcctgggc tga 1293 



<210> 32 

<211> 430 

<212> PRT 

<213> Streptoirr^ces lydicus 
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<400> 32 

Met Ser Ala Ser Thr Ser Ser Pro Leu Ser Ala His Val Gly Lys His 

15 10 15 

Pro Gly Glu Pro His Val Met Asp Pro Ala Leu lie Ser Asp Pro Phe 

20 25 30 

Gly Gly Tyr Gly Ala Leu Arg Glu Gin Gly Pro Val Val Arg Gly Arg 

35 40 45 

Phe Phe Asp Asp Ser Pro Leu Trp Leu Val Thr Arg Phe Glu Glu Val 

50 55 60 

Arg Gin Val Leu Arg Asp Gin Arg Phe Val Asn Asn Pro Ala Asp Pro 
65 70 75 80 

Ala Leu Gly Val Ala Pro Glu Asp Ser Pro Gin Leu Arg Ala Leu Ala 

85 90 95 

Met Leu Gly lie Pro Glu His Leu His Gly Tyr Leu Leu Asn Ser lie 

100 105 110 

Leu Asn Tyr Asp Ala Pro Asp His Thr Arg Leu Arg Arg Leu Val Ser 

115 120 125 

Arg Ala Phe Thr Ala Arg Lys He Thr Asp Leu Arg Pro Arg Val Ala 

130 135 140 

Gin He Thr Ala Glu Leu Leu Asp Arg Leu Pro Glu His Ala Glu Asp 
145 150 155 160 

Gly Val Val Asp Leu He Glu His Phe Ala Tyr Pro Leu Pro He Thr 

165 170 175 

Val He Cys Glu Leu Val Gly He Ala Ala Glu Asp Arg Pro Gin Trp 

180 185 190 

Arg Ser Trp Gly Ala Asp Leu Val Ser Val Asp Pro Asp Arg Leu Gly 

195 200 205 

Arg Thr Phe Pro Ala Met He Asp His He His Ala Leu He Gly Gin 

210 215 220 

Arg Arg Ala Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu He Arg Thr 
225 230 235 240 

His Asp Asp Asp Gly Ser Arg Leu Ser Asp Val Glu Met Val Thr Leu 

245 250 255 

Val Leu Thr Leu Val Leu Ala Gly His Glu Thr Thr Ala His Leu He 

260 265 270 

Gly Asn Gly Thr Ala Ala Leu Leu Thr His Pro Asp Gin Leu Arg Leu 

275 280 285 

Leu Lys Asp Asp Pro Ala Leu Leu Pro Arg Ala Val His Glu Leu Met 

290 295 300 

Arg Trp Cys Gly Pro Val His Val Thr Gin Leu Arg Tyr Ala Ala Glu 
305 310 315 320 

Asp Val Asp Leu Ala Gly Thr Arg He Arg Arg Gly Asp Ala Val Gin 

325 330 335 

Ala Val Leu Val Ser Ala Asn His Asp Pro Arg His Tyr Thr Asp Pro 

340 345 " 350 

Glu Arg Leu Asp Leu Thr Arg Gin Pro Ala Gly Arg Ala Glu Asn His 

355 360 365 

Val Gly Phe Gly His Gly Ala His Tyr Cys Leu Gly Ala Ser Leu Ala 

370 375 380 

Arg Gin Glu Gly Glu Val Ala Leu Gly Ala Leu Phe Asp Arg Tyr Pro 
385 390 395 400 

Asp Leu Ala Leu Ala Val Ala Pro Glu Glu Leu Glu Arg Thr Pro Val 
405 410 415 
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Pro Gly Thr Trp Arg Leu Thr Ser Leu Pro Val Arg Leu Gly 
420 425 430 



<210> 33 
<211> 1281 
<212> DNA 

<213> Streptomyces tubercidicus 
<400> 33 

atgaactctc cgttcgccgc gcacgtcggg aaacacccgg gcgagccgaa tgtgatggac 60 

cccgccctga tcaccgaccc gttcaccggc tacggcgcgc tgcgtgagca gggcccggtc 120 

gtacggggcc ggttcatgga cgactcgccc gtctggctgg tgacgcggtt cgaggaggtc 180 

cgccaggtcc tgcgcgacca gcggttcgtg aacaatccgg cctcgccgtc cctgaactac 240 

gcgcccgagg acaacccgct gacccggctg atggagatgc tgggcctccc cgagcacctc 300 

cgcgtctacc tgctcggatc gatcctcaac tacgacgccc ccgaccacac ccggctgcgc 360 

cgtctggtgt cgcgggcgtt cacggcccgc aagatcaccg acctgcggcc ccgggtcgag 420 

cagatcgccg acgcgctgct ggcccggctg cccgagcacg ccgaggacgg cgtcgtcgac 480 

ctcatccagc acttcgccta ccccctgccg atcaccgtca tctgcgaact ggtcggcata 540 

cccgaagcgg accgcccgca gtggcgaacg tggggcgccg acctcatctc gatggatccg 600 

gaccggctcg gcgcctcgtt cccggcgatg atcgagcaca tccatcagat ggtccgggaa 660 

cggcgcgagg cgctcaccga cgacctgctc agcgaactga tccgcaccca tgacgacgac 720 

ggcgggcggc tcagcgacgt cgagatggtc accatgatcc tcacgctcgt cctcgccggc 780 

cacgagacca ccgcccacct catcagcaac ggcacggcgg cgctgctcac ccaccccgac 840 

cagctgcgtc tggtcaagga cgatccggcc ctcctccccc gtgccgtcca cgagctgatg 900 

cgctggtgcg ggccggtgca catgacccag ctgcgctacg ccaccgccga cgtcgacctc 960 

gccggcacac cgatccgcca gggcgatgcc gttcaactca tcctggtatc ggccaacttc 1020 

gacccccgtc actacaccga ccccgaccgc ctcgatctca cccggcaccc cgcgggccac 1080 

gccgagaacc atgtgggttt cggccatgga gcgcactact gcctgggcgc cacactcgcc 1140 

aaacaggaag gtgaagtcgc cttcggcaaa ctgctcacgc actacccgga catatcgctg 1200 

ggcatcgccc cggaacacct ggagcggaca ccgctgccgg gcaactggcg gctgaactcg 1260 

ctgccggtgc ggttggggtg a 1281 

<210> 34 
<211> 426 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 34 

Met Asn Ser Pro Phe Ala Ala His Val Gly Lys His Pro Gly Glu Pro 

15 10 15 

Asn Val Met Asp Pro Ala Leu lie Thr Asp Pro Phe Thr Gly Tyr Gly 

20 25 30 

Ala Leu Arg Glu Gin Gly Pro Val Val Arg Gly Arg Phe Met Asp Asp 

35 40 45 

Ser Pro Val Trp Leu Val Thr Arg Phe Glu Glu Val Arg Gin Val Leu 

50 55 60 

Arg Asp Gin Arg Phe Val Asn Asn Pro Ala Ser Pro Ser Leu Asn Tyr 
65 70 75 80 

Ala Pro Glu Asp Asn Pro Leu Thr Arg Leu Met Glu Met Leu Gly Leu 

85 90 95 

Pro Glu His Leu Arg Val Tyr Leu Leu Gly Ser lie Leu Asn Tyr Asp 
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100 



Ala 


Pro 


Asp 


His 


Thr 


Arg 


Leu 


Arg 






115 










120 


Ala 


Arg 


Lys 


He 


Thr 


Asp 


Leu 


Arg 




130 










135 




Ala 


Leu 


Leu 


Ala 


Arg 


Leu 


Pro 


Glu 


145 










150 






Leu 


He 


Gin 


His 


Phe 


Ala 


Tyr 


Pro 










165 








Leu 


Val 


Gly 


He 


Pro 


Glu 


Ala 


Asp 








180 










Ala 


Asp 


Leu 


lie 


Ser 


Met 


Asp 


Pro 






195 










200 


Ala 


Met 


He 


Glu 


His 


He 


His 


Gin 




210 










215 




Leu 


Thr 


Asp 


Asp 


Leu 


Leu 


Ser 


Glu 


225 










230 






Glv 


Glv 


Arg 


Leu 


Ser 


Asp 


Val 


Glu 










245 








Val 


Leu 


Ala 


Gly 


His 


Glu 


Thr 


Thr 








260 










Ala 


Ala 


Leu 


Leu 


Thr 


His 


Pro 


Asp 






275 










280 


Pro 


Ala 


Leu 


Leu 


Pro 


Arg 


Ala 


Val 




290 










295 




Pro 


Val 


His 


Met 


Thr 


Gin 


Leu 


Arg 


305 










310 






Ala 


Gly 


Thr 


Pro 


He 


Arg 


Gin 


Gly 










325 








Ser 


Ala 


Asxi 


Phe 


Asp 


Pro 


Arg 


His 








340 










Leu 


Thr 


Arg 


His 


Pro 


Ala 


Glv 


His 






355 










360 


His 


Gly 


Ala 


His 


Tyr 


Cys 


Leu 


Gly 




370 










375 




Glu 


Val 


Ala 


Phe 


Gly 


Lys 


Leu 


Leu 


385 










390 






Gly 


He 


Ala 


Pro 


Glu 


His 


Leu 


Glu 










405 








Arg 


Leu 


Asn 


Ser 


Leu 


Pro 


Val 


Arg 



420 



105 










110 






Arcr 


Leu 


Val 


Ser 


Arg 


Ala 


Phe 


Thr 










125 








Pro 


Arcr 


Val 


Glu 


Gin 


He 


Ala 


Asp 








140 










His 


Ala 


Glu 


Asp 


Gly 


Val 


Val 


Asp 






155 










' 160 


Leu 


Pro 


He 


Thr 


Val 


He 


Cys 


Glu 




170 










175 




Arg 


Pro 


Gin 


Trp 


Arg 


Thr 


Trp 


Gly 


185 










190 






Asp 


Arcr 


Leu 


Gly 


Ala 


Ser 


Phe 


Pro 










205 








Met 


Val 


Arcr 


Glu 


Arg 


Arg 


Glu 


Ala 








220 










Leu 


He 


Arcr 


Thr 


His 


Asp 


Asp 


Asp 






235 










240 


Met 


Val 


Thr 


Met 


He 


Leu 


Thr 


Leu 




250 










255 




Ala 


His 


Leu 


He 


Ser 


Asn 


Gly 


Thr 


265 










270 






Gin 


Leu 


Arg 


Leu 


Val 


Lys 


Asp 


Asp 










285 








His 


Glu 


Leu 


Met 


Arg 


Tro 


Cys 


Glv 








300 










Tvr 


Ala 


Thr 


Ala 


Asp 


Val 


Asp 


Leu 






315 










320 


Asp 


Ala 


Val 


Gin 


Leu 


He 


Leu 


Val 




330 










335 




Tvr 


Thr 


Asp 


Pro 


Asp 


Arg 


Leu 


Asp 


345 










350 






Ala 


Glu 


Asn 


His 


Val 


Glv 


Phe 


Glv 










365 








Ala 


Thr 


Leu 


Ala 


Lys 


Gin 


Glu 


Gly 








380 










Thr 


His 


Tyr 


Pro 


Asp 


He 


Ser 


Leu 






395 










400 


Arg 


Thr 


Pro 


Leu 


Pro 


Gly 


Asn 


Trp 




410 










415 





Leu Gly 
425 



<210> 35 
<211> 195 
<212> DNA 

<213> Strep tomyces tuber cidi cus 
<400> 35 

atgcggatca cgatcgacac cgacatctgt atcggcgccg gccagtgcgc cctgaccgcg ■ 60 
ccgggagtgt tcacccagga cgacgacggc ttcagcgccc tgctgcccgg ccgcgaggac 120 
ggtgcgggcg acccgctggt gcgggaggcc gcccgcgcct gcccggtgca ggccatcacg 180 
gtcacggacg actga 195 



-30- 
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<210> 36 
<211> 64 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 36 

Met Arg lie Thr He Asp Thr Asp He Cys He Gly Ala Gly Gin Cys 

15 10 15 

Ala Leu Thr Ala Pro Gly Val Phe Thr Gin Asp Asp Asp Gly Phe Ser 

20 25 30 

Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly Asp Pro Leu Val Arg 

35 40 45 

Glu Ala Ala Arg Ala Cys Pro Val Gin Ala He Thr Val Thr Asp Asp 
50 55 60 



<210> 37 
<211> 195 
<212> DNA 

<213> Streptoiryces tubercidicus 
<400> 37 

atgcggatca ccatcgacac cgacatctgc atcggcgccg gccagtgcgc cctgaccgcg 60 
ccgggagtct tcacccagga cgacgacggt ttcagcgccc tgctgcccgg ccgcgaggac 120 
ggcgcgggcg acccgctggt gcgcgaggcc gcccgcgcct gccccgtgca ggccatttcg 180 
gtcacggacg actga 195 



<210> 38 
<211> 64 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 38 

Met Arg He Thr He Asp Thr Asp He Cys He Gly Ala Gly Gin Cys 

15 10 15 

Ala Leu Thr Ala Pro Gly Val Phe Thr Gin Asp Asp Asp Gly Phe Ser 

20 25 30 

Ala Leu Leu Pro Gly Arg Glu Asp Gly Ala Gly Asp Pro Leu Val Arg 

35 40 45 

Glu Ala Ala Arg Ala Cys Pro Val Gin Ala He Ser Val Thr Asp Asp 
50 55 60 



<210> 


39 


<211> 


9 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic peptide. 


<400> 


39 



-31 - 
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Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 



1 


5 


<210> 


40 


<211> 


10 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic peptide. 


<400> 


40 



Glu Gin Lys Leu lie Ser Glu Glu Asp Leu 
15 10 



<210> 


41 


<211> 


11 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic peptide. 


<400> 


41 



Gin Pro Glu Leu Ala Pro Glu Asp Pro Glu Asp 
15 10 



<210> 


42 


<211> 


11 


<212> 


PRT 


<213> 


Artificial Sequence 


<220> 




<223> 


Synthetic peptide. 


<400> 


42 



Tyr Thr Asp lie Glu Met Asn Arg Leu Gly Lys 
15 10 



<210> 


43 


<211> 


7 


<212> 


PRT 


<213> 


S treptomyc es 


<220> 




<221> 


mi s c_f eatur e 


<222> 




<223> 


Streptomyces 


<400> 


43 



He Ala Gly His Glu Thr Thr 
1 5 



-32- 



WO 02/092801 
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<210> 44 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc__feature 

<222> (6).. (18) 

<223> Nucleotides 6, 9 and 18 are "s" wherein "s" = g or c. 

<400> 44 

atcgcsggsc acgagacsac 20 



<210> 45 

<211> 7 

<212> PRT 

<213> Streptonr^ces 

<220> 

<221> misc_feature 
<222> 

<223> Streptoirryces consensus sequence 

<400> 45 

Val Ala Gly His Glu Thr Thr 
1 5 



<210> 46 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_jfeature 

<222> (3).. (18) 

<223> Nucleotides 3, 6, 9, and 18 are "s" wherein "s" = g or c. 

<400> 46 

gtsgcsggsc acgagacsac • 20 



<210> 47 

<211> 7 

<212> PRT 

<2 1 3 > S tr ept amyces 

<220> 

<221> misc_f eature 
<222> 

<223> Streptomyces consensus sequence 



<400> 47 

Leu Ala Gly His Glu Thr Thr 
1 5 
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<210> 48 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> inisc_feature 

<222> (3).. (18) 

<223> Nucleotides 3, 6, 9, and 18 are "s" wherein "s" = g or c. 

<400> 48 

ctsgcsggsc acgagacsac 20 



<210> 49 

<211> 9 

<212> PRT 

<213> Streptomyces 

<220> 

<22 1> mi sc__f eatur e 

<222> 

<223> Streptoinyces consensus sequence 



<400> 49 

Leu Leu Leu He Ala Gly His Glu Thr 
1 5 



<210> 50 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 
<222> (2).. (17) 

<223> Nucleotides 2, 5, 8, 14, and 17 are "s" wherein "s" = g or c. 
<400> 50 

tsetse tsat cgcsggscac gagac 25 



<210> 51 

<211> 9 

<212> PRT 

<213> Streptoinyces 

<220> 

<221> misc_feature 
<222> 

<223> Streptoinyces consensus sequence 



<400> 51 

His Gin Cys Leu Gly Gin Asn Leu Ala 
1 5 
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<210> 52 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mis cofeature 

<222> (12) . . (24) 

<223> Nucleotides 12, 15, and 24 are "s" wherein "s" = g or c. 

<400> 52 

gtggtcacgg asccstgctt ggascg 26 



<210> 53 

<211> 8 

<212> PRT 

<213> Streptomyces 

<220> 

<221> misc__f eature 
<222> 

<223> Streptomyces consensus sequence 



<400> 53 

Phe Gly His Gly Val His Gin Cys 
1 5 



<210> 54 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 

<222> (6).. (15) 

<223> Nucleotides 6, 12, and 15 are "s" wherein "s" = g or c. 

<400> 54 

aagccsgtgc cscasgtggt cacg 24 



<210> 55 

<211> 8 

<212> PRT 

<213> Streptomyces 

<220> 

< 2 2 1> mi s c_f ea tur e 
<222> 

<223> Streptomyces consensus sequence 



<400> 55 
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Phe Gly Phe Gly Val His Gin Cys 



1 


5 




<210> 


56 




<211> 


24 




<212> 


DNA 




<213> 


Artificial Sequence 




<220> 






<221> 


mi s c_f ea tur e 




<222> 


(6) . . (15) 




<223> 


Nucleotides 6, 12, and 


15 are " 


<400> 


56 




aaggcsaagc cscasgtggt cacg 




<210> 


57 




<211> 


8 




<212> 


PRT 




<213> 


Streptorryces 




<220> 






<221> 


mi s c_f eatur e 




<222> 






<223> 


Streptomyces consensus 


sequence 


<400> 


57 




Phe Gly His Gly He His Gin Cys 


1 


5 





wherein "s" - g or c. 



24 



<210> 


58 


<211> 


24 


<212> 


DNA 


<213> 


Artificial Sequence 


<220> 




<221> 


misc__f eature 


<222> 


(6) . . (12) 


<223> 


Nucleotides 6 and 12 


<400> 


58 


aagccsgtgc cstaggtggt cacg 


<210> 


59 


<211> 


8 


<212> 


PRT 


<213> 


Streptomyces 


<220> 




<221> 


mi s c_f eature 


<222> 





are "s" wherein "s" = g or c. 



24 



<223> Streptomyces consensus sequence 



-36- 



WO 02/092801 



PCT/EP02/05363 



<400> 59 

Phe Gly His Gly Val His Phe Cys 
1 5 



<210> 60 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_J:eature 

<222> (6) . . (15) 

<223> Nucleotides 6, 12, and 15 are "s" wherein "s" = g or c. 

<400> 60 

aagccsgtgc cscasgtgaa gacg 24 



<210> 61 
<211> 24 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 61 

His Pro Gly Glu Pro Asn Val Met Asp Pro Ala Leu lie Thr Asp Pro 
15 10 15 

Phe Thr Gly Tyr Gly Ala Leu Arg 
20 



<210> 62 
<211> 21 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 62 

Phe Val Asn Asn Pro Ala Ser Pro Ser Leu Asn Tyr Ala Pro Glu Asp 
15 10 15 

Asn Pro Leu Thr Arg 
20 



<210> 63 
<211> 19 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 63 

Leu Leu Thr His Tyr Pro Asp He Ser Leu Gly He Ala Pro Glu His 
15 10 15 

Leu Glu Arg 



-37- 
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<210> 64 
<211> 17 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 64 

Val Tyr Leu Leu Gly Ser He Leu Asn Tyr Asp Ala Pro Asp His Thr 
15 10 15 

Arg 



<210> 65 
<211> 13 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 65 

Thr Trp Gly Ala Asp Leu He Ser Met Asp Pro Asp Arg 
15 10 



<210> 66 
<211> 13 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 66 

Glu Ala Leu Thr Asp Asp Leu Leu Ser Glu Leu He Arg 
15 10 



<210> 67 
<211> 12 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 67 

Phe Met Asp Asp Ser Pro Val Trp Leu Val Thr Arg 
15 10 



<210> 68 
<211> 12 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 68 

Leu Met Glu Met Leu Gly Leu Pro Glu His Leu Arg 
15 10 



<210> 69 



-38- 
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<211> 11 
<212> PRT 

<213> Streptomyces tubercidicus 
<400> 69 

Val Glu Gin lie Ala Asp Ala Leu Leu Ala Arg 
15 10 



<210> 70 
<211> 11 
<212> PRT 

<213> Streptoirryces tubercidicus 
<400> 70 

Leu Val Lys Asp Asp Pro Ala Leu Leu Pro Arg 
15 10 



<210> 71 

<211> 8 

<212> PRT 

<213> Streptomyces tubercidicus 

<400> 71 

Asp Asp Pro Ala Leu Leu Pro Arg 
1 5 



<210> 72 

<211> 8 

<212> PRT 

<213> Streptomyces tubercidicus 

<400> 72 

Thr Pro Leu Pro Gly Asn Trp Arg 
1 5 



<210> 73 

<211> 7 

<212> PRT 

<213> Streptomyces tubercidicus 

<400> 73 

Leu Asn Ser Leu Pro Val Arg 
1 5 



<210> 74 

<211> 7 

<212> PRT 

<213> Streptomyces tubercidicus 



-39- 
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<400> 74 

lie Thr Asp Leu Arg Pro Arg 
1 5 



<210> 75 

<211> 7 

<212> PRT 

<213> Streptoirryces tubercidicus 

<400> 75 

Glu Gin Gly Pro Val Val Arg 
1 5 



<210> 76 

<211> 7 

<212> PRT 

<213> Streptoirryces tubercidicus 

<400> 76 

Ala Val His Glu Leu Met Arg 
1 5 



<210> 77 

<211> 5 

<212> PRT 

<213> Streptoirryces tiibercidicus 

<400> 77 

Ala Phe Thr Ala Arg 
1 5 



<210> 78 

<211> 5 

<212> PRT 

<213> Streptoirryces tubercidicus 

<400> 78 

Phe Glu Glu Val Arg 
1 5 



<210> 79 

<211> 7 

<212> PRT 

<213> Streptoirryces tubercidicus 

<400> 79 

Pro Gly Glu Asp Asn Val Met 
1 5 



-40- 
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<210> 80 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_feature 

<222> (3) . . (18) 

<223> Nucleotides 3, 6, 12, and 18 are "s" wherein "s" = c or g. 
<220> 

<221> mi s c_f eature 

<222> (9) . . (9) 

<223> Nucleotide 9 is "r" wherein "r" = a or g. 
<220> 

<221> mi sc_f eature 

<222> (15).. (15) 

<223> Nucleotide 15 is "y" wherein "y" = c or t. 

<400> 80 

ccsggsgarc csaaygtsat g 21 



<210> 81 

<211> 7 

<212> PRT 

<213> Streptomyces tuber c i di cus 

<400> 81 

Ala Leu He Thr Asp Pro Phe 
1 5 

<210> 82 

<211> 21 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mi sc__f eature 

<222> (3).. (18) 

<223> Nucleotides 3, 6, 12, and 18 are "s "wherein "s" = c or g. 



<400> 82 

gcsctsatya csgacccstt c 21 



<210> 83 

<211> 8 

<212> PRT 

<213> Streptomyces tubercidicus 

<400> 83 

Phe Met Asp Asp Ser Pro Val Trp 
1 5 



-41 - 
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<210> 84 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<222> (13).. (13) 

<223> Nucleotide 13 is "w" wherein "w" = a or t. 
<220> 

<221> misc_f eature 

<222> (14).. (21) 

<223> Nucleotides 14, 15, 18, and 21 are M s" wherein "s" = c or g. 

<400> 84 

ttcatggacg acwssccsgt stgg 24 



<210> 85 

<211> 8 

<212> PRT 

<213> Streptornyces tubercidicus 

<400> 85 

Leu Asn Tyr Asp Ala Pro Asp His 
1 5 



<210> 86 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mi sc__f eature 

<222> (3).. (18) 

<223> Nucleotides 3, 15 and 18 are "s" wherein "s" = c or g. 
<220> 

<221> misc__f eature 

<222> (6) . . (9) 

<223> Nucleotides 6 and 9 are "y" wherein "y" = c or t. 

<400> 86 

ctsaaytayg acgcsccsga ccac 24 



<210> 87 

<211> 8 

<212> PRT 

<213> Streptomyces tubercidicus 

<400> 87 

Val Glu Gin He Ala Asp Ala Leu 
1 5 



-42- 
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<210> 88 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> mi s c__f ea tur e 

<222> (3).. (24) 

<223> Nucleotides 3, 15, 21, and 24 are "s" wherein "s" = c or g. 
<220> 

<221> misc_f eature 

<222> (12) . . (12) 

<223> Nucleotide 12 is "y" wherein "y" = c or t. 
<220> 

<221> mi sc_f eature 

<222> (6).. (6) 

<223> Nucleotide 6 is "r" wherein "r" = a or g. 

<400> 88 

gtsgarcaga tygcsgacgc sets 24 



<210> 89 

<211> 8 

<212> PRT 

<213> Streptomyces tubercidicus 

<400> 89 

Asp Leu He Ser Met Asp Pro Asp 
1 5 



<210> 90 

<211> 24 

<212> DNA 

<213> Artificial Sequence 
<220> 

<221> misc_f eature 

<222> (6) . . (21) 

<223> Nucleotides 6, 11, 12, and 21 are "s" wherein "s" = c or g. 
<220> 

<221> misc_f eature 

<222> (9).. (9) 

<223> Nucleotide 9 is "r" wherein "r" = a or g. 
<220> 

<221> mi sc_f eature 

<222> (10).. (10) 

<223> Nucleotide 10 is "w" wherein "w" = a or t. 

<400> 90 

ctggastarw sstacctggg sctg 24 



<210> 91 
<211> 36 
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<212> DNA 

<213> Streptonr/ces tubercidicus 
<400> 91 

agattaatta atgtcggaat taatgaactg tccgtt 36 



<210> 92 
<211> 32 
<212> DNA 

<213> Streptomyces tubercidicus 
<400> 92 

aaactcaccc caaccgcacc ggcagcgagt tc 32 



<210> 93 

<211> 7 

<212> PRT 

<213> Strep tornyees tubercidicus 

<400> 93 

Met Ser Glu Leu Met Asn Ser 
1 5 



<210> 94 
<211> 1293 
<212> DNA 

<213> Streptomyces tiibercidicus 



<400> 94 

atgtcggcaa tatccagctc cccgttcgcc 
aatgtgatgg acccggcgct gatcaccgac 
caaggccccg tcctaccggg ccggttcatg 
ttcgaagagg tccgccaagt cctgcgcgat 
tcaccggggc attcgatcga cgagagcccc 
cccgaacatt tccggccgta tctgatgggg 
acccggctgc gccgtctggt gtcacgcgcg 
ccgcgggtcg agcagctcgc cgacgagctg 
ggtgtggtcg acctgatcaa gcacttcgcc 
ctggtcggca tcccggaagc ggaccgcccg 
tcgctgcagc cggagcggct cagcacctcg 
ctgatccgcg agcggcgcgg cgcgctcacc 
catgacgacg acggcagccg gctcagcgac 
gtcctggccg gccacgagac caccgcccac 
acccaccccg accagctgcg cctggtcaag 
cacgagctgc tgcgctggtg cgggccggtc 
gatgtcgaga tcgccgggac gccgatccgt 
tcggcgaact tcgacccccg ccactacacc 
cccgccggcc acgccgagaa ccatgtgggc 
gccaccctcg ccaaacagga gggcgaagtc 
gagctgtcgc tggccgtcgc accggacgag 



gcacacgtcg gaaagcatcc cggcgagccg 60 
ccgttcggcg gctacggcgc actgcgtgag 120 
gacgactcac ccgtctggct cgtgacgcgc 180 
cagcggttcc tgaacaaccc ggccgcgtcg 240 
acggccaggc tgctggacat gatggggatg 300 
tcgatcctca acaacgacgc ccccgaccac 360 
ttcacggcac gcaagatcac cgatctgcgg 420 
ctggcccggc ttcccgagca cgccgaggac 480 
tatcccctgc cgatcaccgt gatctgcgaa 540 
caatggcgga agtggggcgc cgacctcgtt 600 
ttcccggcga tgatcgagca catccatgaa 660 
gacgatctgc tcagcgagct gatccgtacc 720 
gtcgagatgg tcaccatggt cctcaccgtc 780 
ctgataggca acggcacggc ggcgctgctc 840 
gacgacccgg agctgcttcc gcgtgccgtc 900 
cagatgaccc agctgcggta cgcctccgag 960 
aagggcgacg ccgtacaact catcctggta 1020 
gcccccgaac gcctcgacct gacccgccac 1080 
ttcggccacg gaatgcacta ctgcctgggc 1140 
gcgttcggca agctcttcac gcactacccg 1200 
ttggagcgaa cgccggtgcc cggcagctgg 1260 
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cggttggatt cgctgccggt gcggttgggg tga 1293 



<210> 95 
<211> 430 
<212> PRT 

<213> Streptoiryces tubercidicus 



<400> 95 



Met 
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lllX 


i\rg Xieu 


Arg Arg Leu vax oer 






no 










1 9 O 

XZi u 






IOC 
1Z 3 


Arg 


jf-iXci 


i lit? 


iliX 


Al ^ 


Arg 


Lys 


Tl o 


X11X7 


A T" ~\ 

riSp XieLl 


ax y Jrx o ax y vax oiu 




X~> 










_L J J 








1 Ad 




XjfciLA 




A C?T^ 


ulu 


J_Jfc2LA 


J— ICS LA 


Al ^ 


AfTY 
-"XLJ 


XjfcrU. rxu 


min Hie Al =» nin 7\a-n 
vj-L LI nib jtIXcl ulu z-ii=>ky 


1 

-Lt: J 










1 RO 

_L «J L/ 








1 RR 
-L ^ ~J 


1 fiO 

1UU 




V dX 


V^l 






Tl e 

Xitr 




mo 


Php 
JT lie 


Al THrr 
rua. iyi 


Pvn T i Prn Tl TTtk* 










165 










170 


175 


V^l 

V CLJL 


Tl o 


v_-Jf o 


rzi n 

VJT_L LA 


Leu 


Val 




He 




Gl u Al a 


So-n AT~rr "P-k"o fZl "n Tvn 

^i£D_Ly rlLLJ Jl J- LJ Lj J — L± J. J_ £j 








180 










185 




190 


Arg 


Lys 


Tm 


Gly 


Ala 


Asp 


Leu 


Val 


Ser 


T,pn Qln 


p-rn C4l n At~ct TiP^n ^^t*' 






195 










200 






205 


Thr 


Ser 


Plie 


Pro 


Ala 


Met 


He 


Glu 


His 


He His 


(^Iti T.f=»7i Tl A r~rr rjln 

vU-L LA UC LA _! — L xl_L y VU JL LA 




210 










215 








220 


Arg 


Arg 


Gly 


Ala 


Leu 


Thr 


Asp 


Asp 


Leu 


Leu Ser 


Glu Leu He Arg Thr 


225 










230 








235 


240 


His 


Asp 


Asp 


Asp 


Gly 


Ser 


Arg 


Leu 


Ser 


Asp Val 


Glu Met Val Thr Met 










245 










250 


255 


Val 


Leu 


Thr 


Val 


Val 


Leu 


Ala 


Gly 


His 


Glu Thr 


Thr Ala His Leu He 








260 










265 




270 


Gly 


Asn 


Gly 


Thr 


Ala 


Ala 


Leu 


Leu 


Thr 


His Pro 


Asp Gin Leu Arg Leu 






275 










280 






285 


Val 


Lys 


Asp 


Asp 


Pro 


Glu 


Leu 


Leu 


Pro 


Arg Ala 


Val His Glu Leu Leu 




290 










295 








300 


Arg 


Trp 


Cys 


Gly 


Pro 


Val 


Gin 


Met 


Thr 


Gin Leu 


Arg Tyr Ala Ser Glu 


305 










310 








315 


320 


Asp 


Val 


Glu 


He 


Ala 


Gly 


Thr 


Pro 


He 


Arg Lys 


Gly Asp Ala Val Gin 










325 










330 


335 


Leu 


He 


Leu 


Val 


Ser 


Ala 


Asn 


Phe 


Asp 


Pro Arg 


His Tyr Thr Ala Pro 








340 










345 




350 
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Glu Arg Leu Asp Leu Thr Arg His Pro Ala Gly His Ala Glu Asn His 

355 360 365 

Val Gly Phe Gly His Gly Met His Tyr Cys Leu Gly Ala Thr Leu Ala 

370 375 380 

Lys Gin Glu Gly Glu Val Ala Phe Gly Lys Leu Phe Thr His Tyr Pro 
385 390 395 400 

Glu Leu Ser Leu Ala Val Ala Pro Asp Glu Leu Glu Arg Thr Pro Val 

405 410 415 

Pro Gly Ser Trp Arg Leu Asp Ser Leu Pro Val Arg Leu Gly 
420 425 " 430 



<210> 96 
<211> 18 
<212> DNA 

<213> Artificial Sequence 



<400> 96 

cgsccsccsc tswssaas 18 



<210> 97 
<211> 21 
<212> DNA 

<213> Artificial Sequence 



<400> 97 

sassgcstts bcccartgyt c 21 



<210> 98 
<211> 1266 
<212> DNA 

<213> Streptomyces tubercidicus 



<400> 98 

gtggtcgacg cacaccagac gttcgtcatc gtcgggggtg gcctggccgg cgcaaaggcc 60 
gcggagactc tccgcgcgga ggggttcacc ggccgggtga tcctcatctg tgacgagcgc 120 
gaccacccgt acgagcgccc cccgctctcc aaggggttcc tgctcggcaa ggaagagcgc 180 
gacagcgtgt tcgtccatga gcccgcctgg tacgcccagg cacagatcga actgcacctg 240 
ggccagcccg ccgtccgcct cgaccccgag ggcaggaccg tccgcctcgg cgacggcacc 300 
ctgatcgcct acgacaagct gctgctggcc accggcgccg aaccgcggcg cctggacatc 360 
cccggcaccg gcctggccgg cgtgcaccac ctgcgccgcc tcgcccacgc cgaacggctg 420 
cgcggcgtcc tggcctccct cggccgcgac aacggccatc tggtgatcgc cggagccggc 480 
tggatcggcc tggaggtcgc cgccgcggcc cgctcctacg gcgccgaggt gaccgtcgtc 540 
gaggccgccc cgacgccgct gcacggcatc ctggggcccg aactcggcgg tctgttcacc 600 
gatctgcacc gcgagcacgg cgtccgcttc cacttcggcg cccgcttcac cgagatcgtc 660 
ggagagggcg gcatggtgct cgccgtgcgc accgacgacg gcgaggaaca ccccgcccac 720 
gatgtgctcg ccgcgatcgg cgccgccccg cgcaccgcgc tcgccgaaca ggccgggctg 780 
gatctcgccg acccggagac cggcggcggg gtggccgtcg acgcggcgct gcgcacctcc 840 
gacccgtaca tctacgccgc cggtgacgtc gccgccgccg accacccgct gctggacacc 900 
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cggctgcggg tcgaacactg ggccaacgcc 
atgctcggcc aggacatcag ctacgaccgc 
gtcggcatgg agtactccgg ctacgccccg 
ggcgacgtcg ccaagcggga gttcatcgcc 
gcgggcatga acgtcaacgt ctgggacgtc 
ggggcgccgt tggagcccgg cgcactggcc 
ccgtag 



ctcaacggcg gcccggccgc cgcccgcgcc 960 
atcccgtact tcttctccga ccagtacgac 1020 
cccggctcgt acgcccaggt cgtctgccgc 1080 
ttctggctgg cggcggacgg ccggctgctc 1140 
gccgagtcca tccagcaact catccgctcc 1200 
gatccgcagg ttccgctggc ggcactgctc 1260 

1266 



<210> 99 
<211> 421 
<212> PRT 

<213> Streptorryces tubercidicus 



<400> 99 



Val 


Val 


Asp 


Ala 


His 


Gin 


Thr 


Phe 


1 








5 








Gly 


Ala 


Lys 


Ala 


Ala 


Glu 


Thr 


Leu 








20 










Val 


lie 


Leu 


lie 


Cys 


Asp 


Glu 


Arg 






35 










40 


Leu. 


Ser 


Lys 


Glv 


Phe 


Leu 


Leu 


Glv 




50 










55 




Val 


His 


Glu 


Pro 


Ala 


Trp 


Tyr 


Ala 


65 










70 






Gly 


Gin 


Pro 


Ala 


Val 


Arg 


Leu 


Asp 










85 








Gly 


Asp 


Gly 


Thr 


Leu 


lie 


Ala 


Tyr 








100 










Ala 


Glu 


Pro 


Arg 


Arcr 


Leu 


Asp 


lie 






115 










120 


His 


His 


Leu 


Arg 


Arg 


Leu 


Ala 


His 




130 










135 




Ala 


Ser 


Leu 


Gly 


Arg 


Asp 


Asn 


Gly 


145 










150 






Trp 


lie 


Gly 


Leu 


Glu 


Val 


Ala 


Ala 










165 








Val 


Thr 


Val 


Val 


Glu 


Ala 


Ala 


Pro 








180 










Pro 


Glu 


Leu 


Gly 


Gly 


Leu 


Phe 


Thr 






195 










200 


Arg 


Phe 


His 


Phe 


Gly 


Ala 


Arg 


Phe 




210 










215 




Met 


Val 


Leu 


Ala 


Val 


Arg 


Thr 


Asp 


225 










230 






Asp 


Val 


Leu 


Ala 


Ala 


lie 


Gly 


Ala 










245 








Gin 


Ala 


Gly 


Leu 


Asp 


Leu 


Ala 


Asp 








260 










Val 


Asp 


Ala 


Ala 


Leu 


Arg 


Thr 


Ser 






275 










280 


Asp 


Val 


Ala 


Ala 


Ala 


Asp 


His 


Pro 




290 










295 





Val 


lie Val 


Gly 


Glv 


Glv 


Leu 


Ala 




10 








15 




Arg 


Ala Glu 


Glv 


Phe 


Thr 


Glv 


Arg 


25 








30 






Asp 


His Pro 


Tvr 


Glu 


Arg 


Pro 


Pro 








45 








LyS 


Glu Glu 


Ara 


Asp 


Ser 


Val 


Phe 






60 










Gin 


Ala Gin 


He 


Glu 


Leu 


His 


Leu 




75 










80 


Pro 


Glu Gly 


Arg 


Thr 


Val 


Arg 


Leu 




90 








95 




Asp 


Lys Leu 


Leu 


Leu 


Ala 


Thr 


Gly 


105 








110 






Pro 


Gly Thr 


Gly 


Leu 


Ala 


Gly 


Val 








125 








Ala 


Glu Arg 


Leu 


Arg 


Gly 


Val 


Leu 






140 










His 


Leu Val 


He 


Ala 


Gly 


Ala 


Gly 




155 










160 


Ala 


Ala Arg 


Ser 


Tyr 


Gly 


Ala 


Glu 




170 








175 




Thr 


Pro Leu 


His 


Gly 


He 


Leu 


Gly 


185 








190 






Asp 


Leu His 


Arg 


Glu 


His 


Gly 


Val 








205 








Thr 


Glu He 


Val 


Gly 


Glu 


Gly 


Gly 






220 










Asp 


Gly Glu 


Glu 


His 


Pro 


Ala 


His 




235 










240 


Ala 


Pro Arg 


Thr 


Ala 


Leu 


Ala 


Glu 




250 








255 




Pro 


Glu Thr 


Gly 


Gly 


Gly 


Val 


Ala 


265 








270 






Asp 


Pro Tyr 


He 


Tyr 


Ala 


Ala 


Gly 








285 








Leu 


Leu Asp 


Thr 


Arg 


Leu 


Arg 


Val 



300 
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Glu 


His 




Ala 


Asn 


Ala 


Leu 


Asn 


Gly 


Gly 


Pro 


Ala 


Ala 


Ala 


Arg 


Ala 


305 










310 










315 










320 


Met 


Leu 


Gly 


Gin 


Asp 
325 


He 


Ser 


x y ± - 


Asp 


Arg" 
330 


He 


Pro 


Tvr 


Phe 


Phe 
335 


Ser 


Asp 


Gin 


Tvr 


Asp 
340 


Val 


Glv 


Met 


Glu 


TVr 
345 


Ser 


Glv 


Tvr 


Ala 


Pro 
350 


Pro 


Glv 


Ser 




Ala 
355 


Gin 


Val 


Val 




360 


Glv 


Asp 


Val 


Ala 


Lys 
365 


Arg 


Glu 


Phe 


He 


Ala 
370 


Phe 


Trp 


Leu 


Ala 


Ala 
375 


Asp 


Gly 


Arg 


Leu 


Leu 
380 


Ala 


Gly 


Met 


Asn 


Val 


Asn 


Val 


Trp 


Asp 


Val 


Ala 


Glu 


Ser 


lie 


Gin 


Gin 


Leu 


He 


Arg 


Ser 


385 










390 










395 










400 


Gly 


Ala 


Pro 


Leu 


Glu 
405 


Pro 


Gly 


Ala 


Leu 


Ala 
410 


Asp 


Pro 


Gin 


Val 


Pro 
415 


Leu 


Ala 


Ala 


Leu 


Leu 


Pro 

























420 



<210> 100 
<211> 1314 
<212> DNA 

<213> Streptomyces tubercidicus 



<400> 100 

atgcccgctg cacgccgccg ccttcgacct ccgcaccgga gcggcgacct gcctgcccgc 60 
ccgccgggcc gtgcgcaccc accccgtgac cgtccaggac ggcatgatct acgtccatca 120 
cgccgcggag gagggcaccg ccgcatgaag- tcggtcgctg tcatcggggc ctcgctggcg 180 
ggcctgtacg ccgcgcggtc cctgcgttcc caggggttcg acggccgcct ggtgatcgtc 240 
ggggacgagt gccacggccc ctacgaccgg cccccgctgt ccaaggactt cctcaccggc 300 
gccaccgacc cgggccgact cgccctggcc gacgccgagg agatcgccga actcgacgcc 360 
gaatggctgc tgggcacccg ggccaccggg ctcgacaccg gcggacgcac ggtgctgctc 420 
gatggcggcc ggtccctgac caccgacggc gtggtcctcg ccaccggcgc cgccccgcgc 480 
ctgctccccg gaccggtgcc cgccggggtc cacaccctgc gcaccctcga cgacgcccag 540 
gcgctccgtg cggatctggc gccggcgccg gtccgggtcg tggtgatcgg cggcggcttc 600 
atcggcgccg aggtcgcctc gtcctgcgcc gccctaggcc atgacgtcac cgtggtcgag 660 
gccgcgccgc tccccctcgt cccccaactc ggccacgcca tggccgagat ctgcgccgcc 720 
ctg"catgcgg accacggcgt cacgctgctc accggaaccg gtgtcgcccg gctgcgcagc 780 
gagggcgacg gccggcgcgt caccggcgtc gagctgaccg acggccgcct gctccccgcc 840 
gacgtggtcg tcgtcggcat cggggtacgc ccccgcaccg cctggctcac ggactccgga 900 
ctgccgctcg acgacggtgt gctctgcgac gcgggctgtg tcaccccgct gcccgccgtc 960 
gtggccgtcg gcgacgtcgc cagggtggac ggcgcccgtg ccgagcactg gaccagcgcc 1020 
accgaacagg ccgccgtggc ggcgcggaac ctgctggccg gcagcaccgt cgcgacccac 1080 
cggagcctgc cgtacttctg gtccgaccag tacggcgtcc gcatccagtt cgcgggccac 1140 
cggctgccca ccgacacacc gcgcgtcctc gaaggctccc ccgacgaccg cagcttcctc 1200 
gcctgttacg aacgggacgg acgcaccacc gcggtgctcg ccctcaaccg gccccgcccc 1260 
ttcatgcggc tccgccgcga actcgcccgc accgccctgt cggccaccac ctga 1314 



<210> 101 
<211> 437 
<212> PRT 

<213> Streptorryces tubercidicus 
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<400> 101 

Met Pro Ala Ala Arg Arg Arg Leu Arg Pro Pro His Arg Ser Gly Asp 

15 10 15 

Leu Pro Ala Arg Pro Pro Gly Arg Ala His Pro Pro Arg Asp Arg Pro 

20 25 30 

Gly Arg His Asp Leu Arg Pro Ser Arg Arg Gly Gly Gly His Arg Arg 

35 40 45 

Met Lys Ser Val Ala Val lie Gly Ala Ser Leu Ala Gly Leu Tyr Ala 

50 ' 55 60 

Ala Arg Ser Leu Arg Ser Gin Gly Phe Asp Gly Arg Leu Val He Val 
65 70 75 80 

Gly Asp Glu Cys His Gly Pro Tyr Asp Arg Pro Pro Leu Ser Lys Asp 

85 90 95 

Phe Leu Thr Gly Ala Thr Asp Pro Gly Arg Leu Ala Leu Ala Asp Ala 

100 105 110 

Glu Glu He Ala Glu Leu Asp Ala Glu Trp Leu Leu Gly Thr Arg Ala 

115 120 125 

Thr Gly Leu Asp Thr Gly Gly Arg Thr Val Leu Leu Asp Gly Gly Arg 

130 135 140 

Ser Leu Thr Thr Asp Gly Val Val Leu Ala Thr Gly Ala Ala Pro Arg 
145 150 155 160 

Leu Leu Pro Gly Pro Val Pro Ala Gly Val His Thr Leu Arg Thr Leu 

165 170 175 

Asp Asp Ala Gin Ala Leu Arg Ala Asp Leu Ala Pro Ala Pro Val Arg 

180 185 190 

Val Val Val He Gly Gly Gly Phe He Gly Ala Glu Val Ala Ser Ser 

195 200 205 

Cys Ala Ala Leu Gly His Asp Val Thr Val Val Glu Ala Ala Pro Leu 

210 215 220 

Pro Leu Val Pro Gin Leu Gly His Ala Met Ala Glu He Cys Ala Ala 
225 230 235 240 

Leu His Ala Asp His Gly Val Thr Leu Leu Thr Gly Thr Gly Val Ala 

245 250 255 

Arg Leu Arg Ser Glu Gly Asp Gly Arg Arg Val Thr Gly Val Glu Leu 

260 265 270 

Thr Asp Gly Arg Leu Leu Pro Ala Asp Val Val Val Val Gly He Gly 

275 280 285 

Val Arg Pro Arg Thr Ala Trp Leu Thr Asp Ser Gly Leu Pro Leu Asp 

290 295 300 

Asp Gly Val Leu Cys Asp Ala Gly Cys Val Thr Pro Leu Pro Ala Val 
305 310 315 320 

Val Ala Val Gly Asp Val Ala Arg Val Asp Gly Ala Arg Ala Glu His 

325 330 335 

Trp Thr Ser Ala Thr Glu Gin Ala Ala Val Ala Ala Arg Asn Leu Leu 

340 345 350 

Ala Gly Ser Thr Val Ala Thr His Arg Ser Leu Pro Tyr Phe Trp Ser 

355 360 365 

Asp Gin Tyr Gly Val Arg He Gin Phe Ala Gly His Arg Leu Pro Thr 

370 375 380 

Asp Thr Pro Arg Val Leu Glu Gly Ser Pro Asp Asp Arg Ser Phe Leu 
385 390 395 400 

Ala Cys Tyr Glu Arg Asp Gly Arg Thr Thr Ala Val Leu Ala Leu Asn 
405 410 415 
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Arg Pro Arg Pro Phe Met Arg Leu Arg Arg Glu Leu Ala Arg Thr Ala 

420 425 430 

Leu Ser Ala Thr Thr 
435 



<210> 102 
<211> 1233 
<212> DNA 

<213> Streptomyces tuber cidicus 



<400> 102 

atggcccaga acacggcatt catcatcgcg 
gagacactgc gcgcggaggg cttcggcggc 
cgtccctacg agcggccgcc gctgtccaag 
aaggcgtacg tccatccgcc ccagtggtac 
aacgccgtca ccgccctcga cccggccggc 
ctgggctacg ccaagctgct gctggccacc 
ggcgccgacc tcgacggggt ccacacgctg 
gacctcttcc ggtccgcgtc ccggatcgtg 
accacggccg ccgcgcgtgc ggcgggggtc 
cccctgctgg gggtgctggg ccgcgaggtc 
cacggtgtcg cgctgcgctg cgacacccag 
gtcgacgggg tacggctggc cgacggcacc 
gtcgggatca cccccaactc cgagacggcc 
gtcgtcgtgg acgagcggct gtgctcctcc 
gccaacgcct accaccccct cctgggcaag 
ctccaccagc cgaagaccgc ggcccgggcc 
ctgccgtact tcttcaccga ccagtacgac 
ccgggcgggt acgaccgcgt ggtgttccgc 
ttctggctct ccggcggccg ggtgctggcc 
gacccgatcc gggccctggt ggcgagcggg 
gcggacgtac cgctggcgga tctggtcccc 



ggagcggggc tggccggggc gaaggccgcg 60 
cccgtcctgc tgctgggcga cgagcgcgag 120 
ggctacctct tgggcacctc cgagcgggag 180 
gccgagcacg acgtcgatct gcggctgggc 240 
cacgaggtga ccctcgccga cggcagccgg 300 
ggctccactc cgcgccggct gccggtgccc 360 
cggtacctgg cggacagcga ccgcctcaag 420 
gtgatcggcg gcggctggat cggcctggag 480 
gaggtgaccg tgctggagtc ggcgccgctg 540 
gcccaggtct tcgccgatct gcacaccgag 600 
gtcacggaga tcaccggcac gaacggcgcg 660 
cggatcgcgg ccgacgcggt gatcgtcggc 720 
gccgcggccg ggctcaaggt cgacaacggc 780 
cacccggaca tctacgccgc cggcgacgtc 840 
cacctccgcg tcgagcactg ggccaacgcc 900 
atgctgggcg gggaggccgg ctacgaccgg 960 
ctgggcatgg agtacacggg gcatgtggag 1020 
ggcgacaccg gtgcccgcga gttcatcgcc 1080 
gggatgaatg tgaacgtatg ggacgtcacc 1140 
cgggccgtgg accccgagcg gctcgccgac 1200 
tga 1233 



<210> 103 
<211> 410 
<212> PRT 

<213> Streptomyces tubercidicus 



<400> 103 

Met Ala Gin Asn Thr Ala Phe lie lie Ala Gly Ala Gly Leu Ala Gly 

15 10 15 

Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Gly Gly Pro Val 

20 25 30 

Leu Leu Leu Gly Asp Glu Arg Glu Arg Pro Tyr Glu Arg Pro Pro Leu 

35 40 45 

Ser Lys Gly Tyr Leu Leu Gly Thr Ser Glu Arg Glu Lys Ala Tyr Val 

50 55 60 

His Pro Pro Gin Trp Tyr Ala Glu His Asp Val Asp Leu Arg Leu Gly 
65 70 ~ 75 80 

Asn Ala Val Thr Ala Leu Asp Pro Ala Gly His Glu Val Thr Leu Ala 
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85 



Asp 


Gly 


Ser 


Arg 


Leu 


Glv Tvx 


Ala 








100 








Thr 


Pro 


Arg 


Arg 


Leu 


Pro Val 


Pro 






115 








120 


Thr 


Leu 


Arg 


Tyr 


Leu 


Ala Asp 


Ser 




130 








135 




Ser 


Ala 


Ser 


Arg 


He 


Val Val 


He 


145 










150 




Thr 


Thr 


Ala 


Ala 


Ala 


Arg Ala 


Ala 










165 






Ser 


Ala 


Pro 


Leu 


Pro 


Leu Leu 


Gly 








180 








Val 


Phe 


Ala 


Asp 


Leu 


His Thr 


Glu 






195 








200 


Thr 


Gin 


Val 


Thr 


Glu 


He Thr 


Gly 




210 








215 




Arcr 


Leu 


Ala 


Asp Gly 


Thr Arg 


He 


225 










230 




Val 


Gly 


He 


Thr 


Pro 


Asn Ser 


Glu 










245 






Val 


Asp 


Asn 


Gly Val 


Val Val 


Asp 








260 








Asp 


lie 


Tvx 


Ala 


Ala 


Gly Asp 


Val 






275 








280 


Glv 


Lys 


His 


Leu Arg 


Val Glu 


His 




290 








295 




Lys 


Thr 


Ala 


Ala Arg 


Ala Met 


Leu 


305 










310 




Leu 


Pro 


Tvr 


Phe 


Phe 


Thr Asp 


Gin 










325 






Gly 


His 


Val 


Glu 


Pro 


Gly Gly 


Tyr 








340 








Thr 


Gly 


Ala 


Arg 


Glu 


Phe He 


Ala 






355 








360 


Leu 


Ala 


Gly 


Met 


Asn 


Val Asn 


Val 




370 








375 




Ala 


Leu 


Val 


Ala 


Ser 


Gly Arg 


Ala 


385 










390 




Ala 


Asp 


Val 


Pro 


Leu 


Ala Asp 


Leu 



405 





90 




95 


Lys 


Leu 


Leu Leu Ala 


Thr Glv Ser 

•J- X J— I— VJJ. l—f \^ml 


105 






110 


Glv 


Ala 


Asd Leu Asd 


Glv Val His 






125 




Asp 


Arg 


Leu Lys Asp 


Leu Phe Arcr 






140 




Glv 


Glv 


Gly Trp He 


Glv Leu Glu 






155 


160 


Glv 


Val 


Glu Val Thr 


Val Leu Glu 




170 




175 


Val 


Leu 


Glv Arcr Glu 


Val Ala Gin 

v u.jl rxi-w v_rxii 


185 






190 


His 


Glv 


Val Ala Leu 


Arcr C*vs Asd 






205 




Thr 


Asn 


Glv Ala Val 

VJ-LJf nj-Ll V (_X_I_ 


Asd Glv Val 






220 




Ala 


Ala 


Ast) Al a Val 


He Val Glv 






235 


240 


Thr 


Ala 


Ala Al a Al a 

xj — L C*. rl± CI. xj — L d 


fill \r T ,en i T A/c* 




250 




255 


Glu 


A t~ct 


TtP*!! (~\/C« C?q-v~ 


Qpr- T-T-5 a Prn 

OCX XxJLto XT JL LJ* 


265 






270 


Ala 


Asn 


Al a TYtt* Hi q 


TD-r-f^) T.qii T.OII 






285 






Al a 


A C«T1 Ala T • 1 


T-H flT -n Pro 






300 




m v 


Glv 


Qlii Ala Gl A7 


xya. -Moky x^ij_y 






315 


320 


Tvr 


Asp 


Leu Gl v Me=>t* 


Qln TK/-r T~hr" 




330 




335 


Asp 


Arcr 

xxi_ y 


Val Val Phe 


Aytt f^l A/ A«=?T> 


345 






350 


ITJ.lt: 




-1— it; U_ Oci. \jT_L_y 


vjj_L_y i-xxy veil 






365 




Trp 


Asp 


Val Thr Asp 


Pro He Arg 






380 




Val 


Asp 


Pro Glu Arg 


Leu Ala Asp 






395 


400 


Val 


Pro 








410 







<210> 104 
<211> 1266 
<212> DNA 

<213> Streptoinyces tubercidicus 



<400> 104 

gtggtcgacg cacaccagac gttcgtcatc 
gcggagactc tccgcgcgga agggttcacc 
gaccacccgt acgagcgccc cccgctctcc 
gacagcgttt tcgtccacga acccgcctgg 



gtcgggggtg gcctggccgg cgcaaaggcc 60 
ggccgggtga tcctcatctg tgacgagcgc 120 
aaggggttcc tgctcggcaa ggaagagcgc 180 
tacgcccagg cacagatcga actgcacctg 240 
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ggccagcccg ccgtccgcct cgaccccgag 
ctgatcgcct acgacaagct gctgctggcc 
cccggcaccg gcctggccgg cgtgcaccac 
cgcggcgtcc tggcctccct cgggcgggac 
tggatcggcc tggaggtcgc cgccgcggcc 
gaggccgccc cgacaccgct gcacggcatc 
gaactgcacc gcgcacacgg cgtgcgcttc 
ggacaggacg gcatggtgct cgccgtgcgc 
gacgtgctcg ccgcgatcgg cgccgccccg 
gacctcgccg acccggaggc cggcggcggc 
gacccgtaca tctacgccgc cggcgacgtg 
cggctgcgcg tcgaacactg ggccaacgcc 
atgctcggcc aggacatcag ctacgaccgc 
gtcggcatgg agtactccgg ctacgccccg 
ggcgacgtcg ccaaacggga gttcatcgcg 
gcggggatga acgtcaacgt ctgggacgtc 
ggggtgcggt tggagcccgg cgagctggct 
ccgtag 



gcgaagaccg tccgcctcgg cgacggcacc 300 
accggcgccg agccgcgccg cctggacatc 360 
ctgcgccgcc tcgcccacgc cgaacggctg 420 
aacgggcatc tggtgatcgc cggcgccggc 480 
cgctcctacg gcgccgaggt caccgtcgtc 540 
ctggggcccg aactcggcgg cctgttcacc 600 
cacttcggcg cccgtttcac cgagatcgtc 660 
accgacgacg gcgaggagca ccccgcccac 720 
cgcaccgcac tcgccgaaca ggccggactc 780 
gtggccgtcg acgcgacgct gcgcacctcc 840 
gccgccgccg accaccccct cctggacacc 900 
ctcaacggcg gcccggccgc cgcgcgcgcc 960 
gtcccgtact tcttctccga ccagtacgac 1020 
cccggctcct acgcacaggt cgtctgccgc 1080 
ttctggctcg gcgaggacgg acggctgctc 1140 
gccgaaacca tccagcaact catccgcggc 1200 
gatccggagg ttccgctgac ctcactgctc 1260 

1266 



<210> 105 
<211> 421 
<212> PRT 

<213> Streptoiryces tubercidicus 



<400> 105 

Val Val Asp Ala His Gin Thr Phe Val lie Val Gly Gly Gly Leu Ala 

1 5 10 15 

Gly Ala Lys Ala Ala Glu Thr Leu Arg Ala Glu Gly Phe Thr Gly Arg 

20 25 30 

Val lie Leu lie Cys Asp Glu Arg Asp His Pro Tyr Glu Arg Pro Pro 

35 40 45 

Leu Ser Lys Gly Phe Leu Leu Gly Lys Glu Glu Arg Asp Ser Val Phe 

50 55 60 

Val His Glu Pro Ala Trp Tyr Ala Gin Ala Gin lie Glu Leu His Leu 
65 70 75 80 

Gly Gin Pro Ala Val Arg Leu Asp Pro Glu Ala Lys Thr Val Arg Leu 

85 90 95 

Gly Asp Gly Thr Leu lie Ala Tyr Asp Lys Leu Leu Leu Ala Thr Gly 

100 105 110 

Ala Glu Pro Arg Arg Leu Asp lie Pro Gly Thr Gly Leu Ala Gly Val 

115 120 125 

His His Leu Arg Arg Leu Ala His Ala Glu Arg Leu Arg Gly Val Leu 

130 135 140 

Ala Ser Leu Gly Arg Asp Asn Gly His Leu Val lie Ala Gly Ala Gly 
145 150 155 160 

Trp lie Gly Leu Glu Val Ala Ala Ala Ala Arg Ser Tyr Gly Ala Glu 

165 170 175 

Val Thr Val Val Glu Ala Ala Pro Thr Pro Leu His Gly He Leu Gly 

180 185 190 

Pro Glu Leu Gly Gly Leu Phe Thr Glu Leu His Arg Ala His Gly Val 

195 200 205 

Arg Phe His Phe Gly Ala Arg Phe Thr Glu He Val Gly Gin Asp Gly 
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210 






215 


Met 


Val 


Leu 


Ala 


V«=5l Atct TTrr A^m 


225 








230 


Asp 


Val 


Leu. 


Ala 


Ala He Glv Ala 










245 


Gin 


Ala 


Gly 


Leu 


Aqn TiPm Al Fi A^r* 

AOU XJ^^U. ZaXU iidjs/ 








260 




Val 


Asp 


Ala 


Thr 


Ti^n At~ct ThT* Qpt* 






275 




280 


Asp 


Val 


Ala 


Ala 


Ala Asn His Pro 




290 






295 


Glu 


His 




Ala 


/IDll Xi-L CI JJC LX xVDll 


305 








310 


Met 


Leu. 


Gly 


Gin 


A«=i*r> Tl P» Qf^"r~ TVtk- 










325 


7\c?-i-\ 


VJJ-X J. 




A csr-» 










340 






T\rr~ 
lyjL 


Al Pi 




V^l V^l fVq A-rrr 






j jj 




JUU 


j_j_e 


/\j_a 


irlie 


Trp 


J_reU vi±y V7-LU Asp 




370 






375 


Val 


Asn 


Val 


Trp 


Asp Val Ala Glu 


385 








390 


Gly 


Val 


Arg 


Leu 


Glu Pro Gly Glu 










405 


Thr 


Ser 


Leu 


Leu 


Pro 








420 





220 



Asp Gly Glu Glu 


His Pro Ala 


His 


235 




240 


Ala Pro Arg Thr 


Ala Leu Ala 


Glu 


250 


255 




Pro Glu Ala Gly 


Gly Gly Val 


Ala 


265 


270 




Asp Pro Tyr He 


Tyr Ala Ala 


Glv 




285 




Leu Leu Asp Thr 


Arg Leu Arg 


Val 


300 






Gly Gly Pro Ala 


Ala Ala Arg 


Ala 


315 




320 


Asp Arg Val Pro 


Tyr Phe Phe 


Ser 


330 


335 




Tvr Ser fflv Tvr 


Al Fi "Pt*0 Prn 

JL CX -L _1_ w XT Ju ^ 


en v 

Vj7_I__y 








Gly Asp Val Ala 


Lys Arg Glu 


Phe 




365 




Gly Arg Leu Leu 


Ala Gly Met 


Asn 


380 






Thr He Gin Gin 


Leu He Arg 


Gly 


395 




400 


Leu Ala Asp Pro 


Glu Val Pro 


Leu 


410 


415 
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BUDAPEST TREAT* ON THE INTEUNAT X ON Ali RECOGNITION Of TH£ DEPOSIT 
OF MICROORGANISMS FOR THE PURPOSE OF PATENT PROCEDURES 



TO 



syngenta 

3054 CojrnwalXiB Road 

Research Triangle Park, NC 27709 



NAME AND ADDRESS OF THE PARTY TO WHOM 
THE VIABILITY STATEMENT IS ISSUED 



INTERNATIONA!. FORM 

VlfiPHiITE. STfVFBMBNX 



issued pursuant to Rule 10.2 by the 
INTERNATIONAL DEPOSITARY AtfTHORITr 
identified afc the bottom of thle rage 



I* DEPOSITOR 



II. IDENTIFICATION OF THE MlCROOggAKISH 



Name: Syngenta 

305 4 Cornwall Is Road 
Address: Research Triangle Park, 

NC 27709 



Depositor's taxonomic designation ana 
accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY s 
streptomyc&a livdtians NRRL 3047ft 

Date oft May &, 2001 
2 Original Deposit 

□ 1 New Deposit 

□ 2 Bepropagation of original Deposit 



III- (a) VIABILITY STATEMENT 
Deposit was found: Viable □ 



Nonviable on May 10, 2001 (Data) 
International Depositary Authority's preparation was found viable on May je, ^ron^at©) 5 



XXX- (p) DEPOSITOR'S EQUIVALENCY DECLARATION 



Depositor determined the International Depositary Authority's preparation was 

j2 4 Equivalent IZL 2 Not equivalent to deposit on ^3 / ^ — - - - (Date) 

Signature of Depositor - 



IV- CONDITIONS UNDER WHICH THE VIABILITY TEST WAS PERFORMED f Depositors /Depositary ) * 



V* INTERNATIONAL DEPOSITARY AUTHORITY 



Narnex Agricultural Besoarch Culture 
Collection (NRRL) 
International Depositary Authority 

Address? 1815 N - University street 

Peoria, Illinois 61604 U.S,A, 



Signature (s) of person(B) having the power 
to represent the International Depositary 
Authority or of authorized off icl.al (a) i 



Pates 




* Indicate the date o£ the original depoait or when a new deposit has been mado. 
7 Mark, with a cross the applicable bo*. 

* In the c*3S3 referred to in Rule 10.2 UHH) a^d till) , ce^ec bo tho moat recent viability test. 

* rill in if the information has been requested. 
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BUDAPEST TREAT* ON THE INTERNATIONAL RECOGNITION OF THE DEPOSIT 
OF MICROORGANISMS FOfc THE PURPOSE OF PATENT PROCEDURES 



TO 



Syngenta 

3054 CornwalliB Road 

Research Triangle Park, NC 27709 



NAME AND ADDRESS OF THE PARTY TO WHOM 
THE VIABILITr* STATEMENT IS ISSUED 



INTERNATIONAL FORM 

VIABIL ITY STATEMENT 



issued pursuant to ftule 10.2 by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
identified at the bottom of this page 



DEPOSITOR 



II, IDENTIFICATION OF THE MICROOaGAKjSM 



Nam© i 
Address : 



Syngantft 

3054 CornwalliB Road 
Research Triangle ParX, 
NC 27709 



Depositor ' s - taxonornic designation ana 
accession number given by the 
INTERNATIONAL DEPOSITARY AUTHORITY 
Pseudorflonas puz±d& NRRL &-30479 

Data of: May 8, 2001 
7 Original Deposit 

□ New Deposit 

□ 2 ^propagation of Original Deposit 



III. (a) VIABILITY STATEMENT 



Deposit was found* ED viable D Nonviable on Hay 10 , 2Q01 (Date) 

International Depositary Authority's preparation was found viable on riay 16 aoonD^te) 5 



III, <b) DEPOSITOR'S EQUIVALENCY DECLARATION 



Depositor determined the International Depositary Authority's preparation was 
Equivalent ^ Not equivalent to deposit on 

, (Date) 



signature of Depositor 



IV, CONDITIONS UNDER WHICH ^HE VIABILITY TEST WAS PERFORMED (Depositors/Depositary) ' 



INTERNATIONAL DEPOSITARY AUTHORITY 



Name: Agricultural Reeearch Culture 
Collection (NRRL) 
International Depositary Authority 

Address: 1615 University street 

Peoria, xllinojg _ $1604 U- s T a» 



Signature (s) of per sort (s) having the power 

to represent the international Depositary 

Authority or of authorized official (e)s 

Datei (? s SJiZ^l 



Indicate the date of the original deposit oc when a new depoait has been made. 
Marie ulth a ctoaa the applicable bp*. 

In the cases referred to in Puis 10.2(a) (ill and {iii) # refer to the jnost recent viability teat. 
Fill In l£ the infonna tion has bean requested. 



